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ABSTRACT 


Database  Tomography  (DT)  is  a  textual  database  analysis  system  consisting  of  two 
major  components:  1)  algorithms  for  extracting  multi-word  phrase  frequencies  and 
phrase  proximities  (physical  closeness  of  the  multi-word  technical  phrases)  from 
any  type  of  large  textual  database,  to  augment  2)  interpretative  capabilities  of  the 
expert  human  analyst.  DT  was  used  to  obtain  technical  intelligence  from  a 
Wireless  LAN  (Local  Area  Network)  database  derived  from  the  Science  Citation 
Index/  Social  Science  Citation  Index  (SCI).  Phrase  frequency  analysis  by  the 
technical  domain  experts  provided  the  pervasive  technical  themes  of  the  Wireless 
LAN  database,  and  the  phrase  proximity  analysis  provided  the  relationships  among 
the  pervasive  technical  themes.  Bibliometric  analysis  of  the  Wireless  LAN 
literature  supplemented  the  DT  results  with  author/  journal/  institution  publication 
and  citation  data. 
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EXECUTIVE  SUMMARY 


A  study  was  performed  to  identify  the  structure  and  infrastructure  of  the  Wireless 
LANs  literature.  An  extensive  query  was  developed  to  retrieve  the  Wireless  LANs 
research  literature,  as  represented  in  the  Science  Citation  Index,  for  different  time 
frames.  For  this  retrieved  literature,  bibliometrics  (counting  of  papers,  citations, 
etc.)  was  performed  to  generate  the  infrastructure  (prolific  authors,  Centers  of 
Excellence,  etc.)  of  the  Wireless  LANs  literature,  and  computational  linguistics 
(grouping  of  phrases,  similar  documents)  was  performed  to  generate  the  technical 
structure  (pervasive  technical  themes,  relationships  among  themes)  of  the  Wireless 
LANs  literature.  Included  in  the  bibliometrics  results  was  identification  of  the 
seminal  papers  of  the  Wireless  LANs  literature. 

Bibliometrics  Results 


Of  the  twenty  most  prolific  authors  from  1991  to  mid-2002,  ten  are  from  Japan. 
In  fact,  twelve  are  from  Asia,  four  are  from  Europe  (Western),  and  four  are  from 
North  America.  Ten  are  from  universities,  seven  are  from  private  industry,  and 
three  are  from  research  institutes.  These  prolific  author  results  show  that  private 
industry  contains  a  substantial  portion  of  the  most  prolific  WLAN  research  and 
development  authors,  reflecting  the  more  applied  research  nature  of  much  of  the 
reported  work.  Additionally,  from  the  mid-90s  to  mid-2002,  there  was  an 
approximately  linear  increase  in  papers  published,  at  about  the  rate  of  100  papers 
per  year. 

In  an  update  of  the  most  prolific  authors  performed  immediately  prior  to 
publication  of  this  report,  it  was  shown  that,  for  the  years  2004  to  mid-2005,  all  but 
two  of  the  most  prolific  authors  had  names  of  apparent  Asian  origin.  The 
significance  of  this  finding  will  be  made  more  explicit  in  the  update  of  most 
prolific  countries. 

The  majority  of  the  journals  containing  the  most  Wireless  LANs  papers  from  1991 
to  mid-2002  are  focused  on  communications,  with  the  remainder  divided  between 
networking  and  electronics.  Bradford’s  Law  allows  journals  to  be  divided  into 
groups,  with  each  group  containing  roughly  similar  numbers  of  publications,  and 
the  ratio  of  the  number  of  journals  in  each  adjacent  group  equaling  a  constant. 

There  appear  to  be  two  primary  groups  at  the  top  layer.  The  IEEE  Journal  on 
Selected  Areas  in  Communications  contains  the  most  articles  by  far  and,  along  with 
the  Journal  of  the  IEICE  Transactions  on  Communications  and  IEEE  Transactions 
on  Vehicular  Technology ,  constitutes  the  first  group.  The  next  group  consists  of 
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Electronics  Letters,  IEEE  Transactions  on  Communications,  IEEE 
Communications  Magazine,  IEICE  Transactions  on  Fundamentals  of  Electronics 
Communications  and  Computer  Sciences,  Microwaves  &  RE,  IEEE  Personal 
Communications,  and  Wireless  Networks. 

While  IEEE  Transactions  on  Vehicular  Technology  may  seem  out  of  place,  this 
journal  publishes  many  papers  dealing  with  electrical  and  electronics  technology  in 
vehicles  and  vehicular  systems.  The  areas  of  communications,  transportation 
systems,  and  vehicular  electronics  define  its  scope. 

The  journals  were  also  updated  for  2004  to  mid-2005.  Of  the  21  journals  pre- 
2003,  and  the  19  journals  post-2003  listed,  ten  were  in  common.  Of  those  not 
shared,  many  of  the  recent  papers  appear  in  journals  whose  titles  suggest  a  wireless 
focus  and  a  mobile  networking  focus.  Because  of  the  technology  specificity,  these 
more  recent  papers  have  even  more  of  an  applications  focus  than  what  was 
perceived  as  an  already  quite  applied  discipline. 

Of  the  twenty  most  prolific  institutions  from  1991  to  mid-2002,  ten  are  from  Asia, 
five  are  from  Western  Europe,  four  from  the  USA,  and  one  from  Eastern  Europe. 
Ten  are  universities,  eight  are  privately  owned  businesses  and  the  remaining 
institutions  are  research  institutes.  These  distributions  reflect  those  of  the  most 
prolific  authors,  and  confirm  the  substantial  contribution  of  the  private  sector  in  the 
time  frame  identified. 

The  institutions  were  also  updated  for  2004  to  mid-2005.  For  the  2004  to  mid- 
2005  results,  there  was  a  shift  in  the  most  prolific  institutions  from  mainly  USA  to 
Asian.  Also,  none  of  the  industrial  organizations  (that  were  so  prominent  in  the 
1991  to  mid-2002  results)  appear  in  the  latest  results.  This  absence  of  industry  can 
be  seen  more  dramatically  by  examining  the  results  from  the  period  1991-1995 
only.  Over  25%  of  the  prolific  institutions  in  this  early  time  frame  are  industrial 
organizations. 

There  appear  to  be  three  dominant  groups  in  the  twenty  most  prolific  countries 
from  1991  to  mid-2002.  The  USA  is  in  a  group  of  its  own,  with  more  than  twice 
the  number  of  papers  as  the  next  most  prolific  country.  Japan  constitutes  the 
second  most  dominant  group  with  substantially  more  papers  than  the  third  most 
prolific  country.  The  sum  of  papers  from  the  US  and  Japan  almost  surpasses  the 
cumulative  sum  of  the  proceeding  eighteen  most  prolific  countries.  South  Korea, 
Italy,  Germany,  Taiwan,  England  and  Canada  make  up  the  third  most  prolific 
group.  Of  the  eleven  most  prolific  countries,  five  are  Asian,  confirming  the 
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prolific  author  and  institution  results.  With  the  exception  of  perhaps  China,  the 
developing  nations  are  playing  almost  no  role  in  the  literature  contribution  of 
Wireless  LANs. 

Temporal  trends  of  country  publications  from  1991  to  mid-2002  show  that  there 
has  been  a  dramatic  increase  in  the  number  of  publications  that  different  countries 
have  produced,  dealing  with  Wireless  LANs.  The  most  notable  increase  has  been 
within  the  United  States.  Japan,  having  less  of  an  increase  than  the  United  States, 
still  has  had  an  increase  in  publications  that  well  surpasses  any  of  the  other 
countries  over  the  1991  to  mid-2002  time  period. 

In  terms  of  absolute  numbers  of  co-authored  papers  from  1991  to  mid-2002,  the 
USA  major  partners  were  Italy,  Japan,  Canada,  and  South  Korea.  Of  the  top  15 
countries  publishing  Wireless  LAN  papers,  most  did  not  have  a  substantial  amount 
of  inter-country  published  papers.  In  cases  where  there  was  inter-country 
publishing,  it  typically  was  with  countries  in  similar  geographical  regions.  The  US 
was  perhaps  the  exception  -  co-publishing  with  many  other  countries  from  various 
geographical  regions. 

The  country  results  were  also  updated  for  2004  to  mid-2005.  There  has  been  a 
major  shift  in  country  emphasis  in  the  past  decade.  Except  for  the  USA,  the 
Western  nations’  leadership  has  been  replaced  by  that  of  the  Asian  nations. 
Especially  large  advances  have  been  made  by  South  Korea  (7%  of  USA  paper 
production  a  decade  ago  to  43%  of  USA  production  presently),  Taiwan  (3%  of 
USA  to  29%  of  USA),  and  Peoples  Republic  of  China  (1%  of  USA  to  25%  of 
USA). 

Of  the  twenty  most  cited  first  authors  for  the  1991  to  mid-2002  database,  seventeen 
are  from  North  America  (USA),  two  are  from  Asia  (Japan),  and  one  is  from 
Europe  (Italy).  This  is  a  far  different  distribution  from  the  most  prolific  authors, 
where  twelve  were  from  Asia  (Far  East).  There  are  a  number  of  potential  reasons 
for  this  difference,  including  difference  in  quality  and  late  entry  into  the  Wireless 
LANs  research  discipline.  In  another  three  or  four  years  when  the  papers  from 
present-day  authors  have  accumulated  sufficient  citations,  firmer  conclusions  about 
quality  can  be  drawn. 

The  lists  of  twenty  most  prolific  authors  and  twenty  most  highly  cited  first  authors 
for  the  1991  to  mid-2002  database  had  only  two  names  in  common  (ADACHI, 
ZORZI).  This  phenomenon  of  minimal  intersection  has  been  observed  in  all  other 
text  mining  studies  performed  by  the  first  author  of  the  present  report.  The  time 
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frame  of  interest  for  most  prolific  authors  is  present  time,  whereas  the  time  frame 
of  interest  for  most  cited  can  span  many  decades.  Researchers  who  may  very  well 
have  been  prolific  when  their  most  citable  work  was  done  may  no  longer  be 
prolific.  Finally,  many  prolific  authors,  especially  at  academic  institutions,  tend 
not  to  be  first  authors  in  most  of  their  publications.  Therefore,  they  tend  to  be 
severely  under-represented  in  the  SCI  citation  downloads,  which  contain  first 
authors  only. 

Twelve  of  the  top  twenty  cited  authors’  institutions  are  universities,  and  five  are 
private  companies.  This  differs  a  bit  from  the  most  prolific  authors  in  that  the 
distribution  between  universities  and  privately  owned  businesses  is  ten  to  seven, 
respectively. 

The  main  focal  points  of  the  majority  of  the  most  highly-cited  papers  from  the 
1991  to  mid-2002  database  are  CDMA  (Code-Division  Multiple  Access)  and  other 
protocols  within  Wireless  LANs.  There  are  a  number  of  other  topics  addressed 
within  the  papers  as  well,  including  data  transmission  and  throughput  in  ATM 
(Asynchronous  Transfer  Mode)  networks  and  Spread  Spectrum  technology, 
Network  Architecture,  and  a  broader  group  that  focuses  on  different  applications 
for  Wireless  Networks.  These  focal  points  correlate  well  with  the  themes  resulting 
from  the  computational  linguistics  (clustering),  as  will  be  shown  later. 

The  list  of  most  cited  journals  from  the  1991  to  mid-2002  database  is  very  similar 
to  the  list  of  most  prolific  journals,  although  some  items  are  not  in  the  same  order. 

Computational  Linguistics  Results 

The  technical  structure  of  the  Wireless  LANs  literature  is  presented  as  a 
hierarchical  taxonomy.  For  taxonomy  generation,  a  combined  factor  matrix  and 
multi-link  word  clustering  process  was  developed  and  used  for  the  first  time.  The 
factor  matrix  served  to  filter  the  words  input  to  the  clustering  algorithm,  identified 
the  context-dependent  trivial  words  to  be  excluded,  and  identified  the  context- 
dependent  words  that  could  be  conflated.  In  addition,  document  clustering 
algorithms  based  on  Greedy  String  Tiling  text  similarity  quantification  and 
Partitional  Document  Clustering  (Objective  Function  Optimization)  were 
developed  and  used  for  the  first  time. 

The  major  total  database  theme  (1991  to  mid-2002  records)  is  research  on  methods 
to  improve  throughput  while  maintaining  Quality-of-Service.  In  all  three 
clustering  approaches  used  in  this  study,  the  top-level  categorization  appeared  to 
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consist  of  two  moderately  distinct  categories.  One  (Improved  Throughput  from 
Physical  Layer  Advances)  addresses  digital  spread  spectrum  modulation 
techniques  (CDMA)  to  increase  throughput  and  user  capacity,  and  the  associated 
efforts  at  the  physical  layer  to  reduce  inherent  interference  limitations  of  CDMA 
on  system  capacity.  The  other  (Improved  Throughput  from  Protocols  and 
Architectures)  covers  Network  Architecture  and  Applications  designed  to  improve 
throughput,  including  infrastructure,  protocols,  hardware  and  applications. 

Level  1 

IMPROVED  THROUGHPUT  FROM  PHYSICAL  LAYER  ADVANCES 

Reduction  of  multiple  access  interference  suppression  in  the  spatial  domain 
and  in  the  code  domain  for  CDMA  systems,  and  physical  approaches  to 
reduce  fading  losses  and  restore  signal  degradation  that  occurred  at  the 
physical  layer. 

IMPROVED  THROUGHPUT  FROM  PROTOCOLS  AND 
ARCHITECTURES 

Protocols  for  mobile  wireless  voice  and  data  traffic  ATM  networks  to  avoid 
packet  collisions,  reduce  call  dropping,  reduce  delays,  and  increase 
throughput  while  maintaining  Quality  of  Service. 

At  the  next  categorization  level,  four  categories  can  be  discerned.  These  categories 
(underlined),  and  their  key  themes  (bulletized)  are  as  follows: 

Level  2 

SIGNAL  QUALITY  MAINTENANCE 

Signal  degradation  at  the  physical  layer,  and  use  of  electronics  technology  to 
restore  and  maintain  signal  quality. 

•  Optical  wave  reflections  off  walls  and  columns  inside  buildings,  and 
propagation  measurements 

•  Antennas,  especially  micro-strip  patch  and  other  arrays,  and  simulated/ 
measured  beam  radiation  patterns  for  reduced  electromagnetic  interference 
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•  Power  amplifiers  in  the  Ghz  frequency  range  for  wireless  LANs,  especially 
those  using  CMOS  hardware  or  circuitry 

CDMA  INTERFERENCE  REDUCTION 

Reduction  of  multiple  access  interference  suppression  in  the  spatial  domain 

and  in  the  code  domain  for  CDMA  systems,  and  physical  approaches  to 

reduce  fading  losses 

•  OFDM  (Orthogonal  Frequency-Division  Multiplexing),  especially  degradation 
from  frequency  offsets,  and  symbol  modification  to  cut  down  on  ISI  (Inter- 
Symbol  Interference)  while  benefiting  from  reduced  path  loss 

•  Use  of  DS\CDMA  (Direct-Sequence\  Code-Division  Multiple  Access)  systems 
that  improve  BER  (Bit  Error  Rate)  within  Rayleigh  multi-path  fading  channels. 

•  Diversity,  by  the  use  of  antenna  diversity  reception  and  coherent  Rake 
combiners,  in  spread- spectrum  communications  over  fast-fading  multi-path 
channels  for  increasing  throughput 

•  Channel  estimation  for  interference-limited  channels,  and  multi-user  detectors 
for  co-channel  interference  cancellation 

•  CDMA  (Code-Division  Multiple  Access)  to  increase  system  capacity  by 
reducing  interference  from  other  users  in  a  cell 

•  Use  of  signal-to-interference  power  ratio  (SIR)  measurement  based  fast  transmit 
power  control  (TPC)  to  reduce  the  minimum  transmit  power  according  to  the 
channel  load  and  the  changes  in  the  link  conditions  due  to  fading 

PACKET  TRAFFIC  PROTOCOLS 

Access  protocols  for  voice  and  data  traffic  that  increase  throughput  and 

reduce  delays  by  packet  collision  avoidance  (e.g.,  CSMA). 

•  TCP  (Transmission  Control  Protocols)  employing  ARQ/FEC  (Automatic 
Repeat  Request/Forward  Error  Correction)  protocols,  to  reduce  packet  error 
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•  MAC  (Medium  Access  Control)  Protocols,  especially  TDMA-based,  that 
efficiently  accommodate  real-time  voice  and  video  traffic  with  dynamic 
bandwidth/  resource  allocation 

•  Access  protocols  for  integrated  voice  and  data  traffic  that  overcome  the  heavy 
load  degraded  performance  due  to  the  contention  mechanism  of  random  access 
in  PRMA  (Packet  Reservation  Multiple  Access)  protocols. 

•  CSMA  (Carrier- Sense  Multiple  Access)  protocols  that  increase  throughput  by 
focusing  on  packet  collision  avoidance 

NETWORK  ARCHITECTURES 

Architectures  and  routing  protocols  for  mobile  ad  hoc  wireless  ATM 

networks  to  improve  Quality  of  Service. 

•  Standard  IEEE  802.11  MAC  (Medium  Access  Control)  protocols  for  wireless 
LANs 

•  Standard  Bluetooth  IEEE  802.1 1  wireless  LAN  devices  and  technology 

•  Providing  Internet  services,  with  enhanced  performance  Internet  protocols  over 
wireless  networks,  to  mobile  communications  systems  users 

•  Architectures  maintaining  guaranteed  levels  of  QoS  (Quality  of  Service)  within 
wireless  mobile  ATM  (Asynchronous  Transfer  Mode)  networks 

•  Handoffs  and  connection  path  rerouting  to  avoid  cell  loss  to  mobile  users  in 
ATM  (Asynchronous  Transfer  Mode)  networks 

•  Call  handoff  management  schemes  to  minimize  dropping  probability  of  handoff 
calls  and  blocking  probability  of  new  calls 

•  Routing  protocols  for  wireless  ad  hoc  mobile  networks 

•  Multicasting,  especially  in  mobile  ad  hoc  networks  with  multiple  hosts,  and  the 
use  of  multicast  delivery  trees  under  mobile  host  conditions 
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Approximately  46%  of  the  CLUTO  (the  software  package  for  generating  the 
partitional  document  clusters)  cluster  documents  are  assigned  by  the  clustering 
algorithm  to  the  first  higher  level  category,  Improved  Throughput  from  Physical 
Laver  Advances,  while  the  other  54%  are  assigned  to  the  second  higher  level 
category,  Improved  Throughput  from  Protocols  and  Architectures.  The  category 
Improved  Throughput  from  Physical  Layer  Advances  breaks  into  two  second  level 
categories,  with  30%  of  its  documents  assigned  to  the  category  Signal  Quality 
Maintenance,  and  70%  assigned  to  category  CDMA  Interference  Reduction.  The 
other  first  level  cluster,  Improved  Throughput  from  Protocols  and  Architectures, 
also  divides  into  two  second  level  groups,  with  36%  of  its  documents  assigned  to 
category  Packet  Traffic  Protocols,  and  64%  of  its  records  assigned  to  category 
Network  Architectures. 

Finally,  the  systematic  method  of  Citation-Assisted  Background  was  used  to 
identify  the  seminal  literature  of  Wireless  LANs.  The  most  cited  references  in  the 
retrieved  Wireless  LANs  database  covering  1991  to  mid-2005  are  tabulated  in 
order  of  inverse  frequency.  Documents  with  citations  above  some  threshold 
frequency  are  tagged.  Then,  the  table  is  re-ordered  chronologically.  The  early 
historical  documents  with  citation  frequencies  substantially  larger  than  those  of 
their  contemporaries  are  selected,  as  are  the  extremely  recent  documents  with 
citation  frequencies  substantially  larger  than  those  of  their  contemporaries.  By 
contemporaries,  it  is  meant  documents  published  in  the  same  time  frame,  not 
limited  to  the  same  year  (see  next  paragraph  for  examples  of  how  we  implement 
‘same  time  frame’).  Then,  the  dynamic  selection  process  defined  above  is  applied 
to  the  early  historical  references,  the  intermediate  time  references  (those  falling 
under  the  high  frequency  threshold),  and  the  extremely  recent  references 
(approximately  two  years  or  less). 

There  were  a  total  of  133  documents  selected  as  seminal.  While  the  earliest 
seminal  documents  in  the  physical  sciences  literatures  the  first  author  has 
examined  previously  were  published  in  the  18th  and  19th  centuries,  the  earliest 
seminal  document  identified  in  the  present  Wireless  LANs  analysis  is  Shannon’s 
classic  1948  paper(s)  on  the  mathematical  theory  of  communication. 

Although  the  journals  associated  with  these  seminal  papers  cover  many  aspects  of 
communication,  three  stand  out  in  terms  of  numbers  of  seminal  papers  published. 
From  1948-1969  (with  an  outlier  in  1996),  the  Bell  System  Technical  Journal 
published  eight  of  these  seminal  papers.  From  1975-1994  (with  an  outlier  in 
1998),  the  IEEE  Transactions  on  Communications  published  19  of  these  seminal 
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papers.  From  1993-2000  (with  an  outlier  in  1987),  the  IEEE  Journal  on  Selected 
Areas  in  Communications  published  15  of  these  seminal  papers. 

Each  of  these  three  journals  commanded  essentially  a  separate  time  frame,  with 
each  time  frame  appearing  approximately  sequentially. 
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1.  INTRODUCTION 


Science  and  technology  are  assuming  an  increasingly  important  role  in  the  conduct 
and  structure  of  domestic  and  foreign  business  and  government.  In  the  highly 
competitive  civilian  and  military  worlds,  there  has  been  a  commensurate  increase 
in  the  need  for  scientific  and  technical  intelligence  to  insure  that  one's  perceived 
adversaries  do  not  gain  an  overwhelming  advantage  in  the  use  of  science  and 
technology.  While  there  is  no  substitute  for  direct  human  intelligence  gathering, 
there  have  become  available  many  techniques  that  can  support  and  complement  it. 
In  particular,  techniques  that  identify,  select,  gather,  cull,  and  interpret  large 
amounts  of  technological  information  semi-automatically  can  expand  greatly  the 
capabilities  of  human  beings  in  performing  technical  intelligence. 

One  such  technique  is  DT  [Kostoff  1993,  1994,  1995],  a  system  for  analyzing  large 
amounts  of  textual  computerized  material.  It  includes  algorithms  for  extracting 
multi-word  phrase  frequencies  and  phrase  proximities  from  the  textual  databases, 
coupled  with  the  topical  expert  human  analyst  to  interpret  the  results  and  convert 
large  volumes  of  disorganized  data  to  ordered  information.  Phrase  frequency 
analysis  (occurrence  frequency  of  multi-word  technical  phrases)  provides  the 
pervasive  technical  themes  of  a  database,  and  the  phrase  proximity  (physical 
closeness  of  the  multi-word  technical  phrases)  analysis  provides  the  relationships 
among  pervasive  technical  themes,  as  well  as  among  technical  themes  and  authors/ 
journals/  institutions/  countries,  etc.  The  present  report  describes  use  of  the  DT 
process,  supplemented  by  literature  bibliometric  analyses,  to  derive  technical 
intelligence  from  the  published  literature  of  Wireless  LAN  technology. 

Wireless  LANs,  as  defined  by  the  authors  for  this  study,  allow  mobile  or  stationary 
users  to  connect  to  a  local  area  network  (LAN)  through  wireless  radio  or  infrared 
connections.  Since  one  of  the  key  outputs  of  the  present  study  is  a  query  that  can 
be  used  by  the  community  to  access  relevant  Wireless  LANs  documents,  a 
recommended  query  based  on  this  study  is  presented  in  total.  This  query  serves  as 
the  operational  definition  of  Wireless  LANs,  and  its  development  is  discussed  in 
detail  in  the  database  generation  section. 

WIRELESS  LANs  QUERY 

(WIRELESS  SAME  (LAN  OR  LANS  OR  IP  OR  HIPERLAN  OR  SPREAD 
SPECTRUM  OR  SPREAD- SPECTRUM  OR  MEDIUM  ACCESS  CONTROL  OR 
CDMA  OR  PROTOCOL*  OR  ATM  OR  TDMA  OR  OFDM  OR 
ASYNCHRONOUS  TRANSFER  MODE  OR  MODEM  OR  MANET  OR  LOCAL 
AREA  NETWORK*  OR  MULTIPATH  OR  CSMA))  OR  WLAN*  OR  (MOBIL* 
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SAME  (LAN  OR  LANS  OR  MEDIUM  ACCESS  CONTROL  OR  INTERNET 
PROTOCOL  OR  ATM  OR  RSVP  OR  TCP  IP  OR  CDMA  OR  CSMA))  OR  (MAC 
SAME  CSMA)  OR  AD  HOC  NETWORK*  OR  BLUETOOTH  OR  CSMA  CA 
OR  CSMA-CA  OR  CSMA  CD  OR  CSMA-CD  OR  HIPERLAN  OR 
DISTRIBUTED  COORDINATION  FUNCTION  OR  WIRELESS  PERSONAL 
AREA  NETWORK*  OR  WPAN  OR  WPANS  OR  IEEE  802.1 1 

To  execute  the  study  reported  in  this  paper,  a  database  of  relevant  Wireless  LAN 
articles  is  generated  using  the  iterative  search  approach  of  Simulated  Nucleation 
[Kostoff  et  al,  1997a,  2001a],  Then,  the  database  is  analyzed  to  produce  the 
following  characteristics  and  key  features  of  the  Wireless  LAN  field:  recent 
prolific  Wireless  LAN  authors;  journals  that  contain  numerous  Wireless  LAN 
papers;  institutions  that  produce  numerous  Wireless  LAN  papers;  keywords  most 
frequently  specified  by  the  Wireless  LAN  authors;  authors,  papers  and  journals 
cited  most  frequently;  pervasive  technical  themes  of  Wireless  LANs;  and 
relationships  among  the  pervasive  themes  and  sub-themes. 

What  is  the  importance  of  applying  DT  and  bibliometrics  to  a  topical  field  such  as 
Wireless  LANs?  The  roadmap,  or  guide,  of  this  field  produced  by  DT  and 
bibliometrics  provides  the  demographics  and  a  macroscopic  view  of  the  total  field 
in  the  global  context  of  allied  fields.  This  allows  specific  starting  points  to  be 
chosen  rationally  for  more  detailed  investigations  into  a  specific  topic  of  interest. 
DT  and  bibliometrics  do  not  obviate  the  need  for  detailed  investigation  of  the 
literature  or  interactions  with  the  main  performers  of  a  given  topical  area  in  order 
to  make  a  substantial  contribution  to  the  understanding  or  the  advancement  of  this 
topical  area,  but  allow  these  detailed  efforts  to  be  executed  more  efficiently.  DT 
and  bibliometrics  are  quantity-based  measures  (number  of  papers  published, 
frequency  of  technical  phrases,  etc.),  and  correlations  with  intrinsic  quality  are  less 
direct.  The  direct  quality  components  of  detailed  literature  investigation  and 
interaction  with  performers,  combined  with  the  DT  and  bibliometrics  analysis,  can 
result  in  a  product  highly  relevant  to  the  user  community. 

2.  BACKGROUND 

Text  Mining 

Text  mining  is  the  extraction  of  useful  information  from  large  volumes  of  text.  Its 
component  capabilities  of  computational  linguistics  and  bibliometrics  were  the 
main  analytical  techniques  used  for  the  present  study,  and  these  capabilities  can  be 
summarized  as  follows. 
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Science  and  technology  (S&T)  computational  linguistics  [Kostoff,  2003a;  Hearst, 
1999;  Zhu  and  Porter,  2002;  Losiewicz,  2000]  is  a  process  that  underlies  the 
extraction  of  useful  information  from  large  volumes  of  technical  text.  It  identifies 
pervasive  technical  themes  in  large  databases  from  technical  phrases  that  occur 
frequently.  It  also  identifies  relationships  among  these  themes  by  grouping 
(clustering)  these  phrases  (or  their  parent  documents)  on  the  basis  of  similarity. 
Computational  linguistics  can  be  used  for: 

•  Enhancing  information  retrieval  and  increasing  awareness  of  the  global 
technical  literature  [Kostoff  et  al,  1997a;  Greengrass,  1997;  TREC,  2003] 

•  Potential  discovery  and  innovation  based  on  merging  common  linkages  among 
very  disparate  literatures  [Swanson,  1986;  Swanson  and  Smalheiser,  1997; 
Kostoff,  2003b;  Gordon  and  Dumais,  1998] 

•  Uncovering  unexpected  asymmetries  from  the  technical  literature  [Goldman  et 
al,  1999;  Kostoff,  2003c]  ' 

•  Estimating  global  levels  of  effort  in  S&T  sub-disciplines  [Kostoff  et  al,  2000a, 
2004a;  Viator  and  Pestorius,  2001] 

•  Helping  authors  potentially  increase  their  citation  statistics  by  improving  access 
to  their  published  papers,  and  thereby  potentially  helping  journals  to  increase 
their  Impact  Factors  [Kostoff  et  al,  2004a,  2004b] 

•  Tracking  myriad  research  impacts  across  time  and  applications  areas  [Davidse 
and  Van  Raan,  1997;  Kostoff  et  al,  2001b], 

Evaluative  bibliometrics  [Narin,  1976;  Garfield,  1985;  Schubert  et  al,  1987]  uses 
counts  of  publications,  patents,  citations  and  other  potentially  informative  items  to 
develop  science  and  technology  performance  indicators.  Its  validity  is  based  on 
the  premises  that  1)  counts  of  patents  and  papers  provide  valid  indicators  of  R&D 
activity  in  the  subject  areas  of  those  patents  or  papers,  2)  the  number  of  times  those 
patents  or  papers  are  cited  in  subsequent  patents  or  papers  provides  valid  indicators 
of  the  impact  or  importance  of  the  cited  patents  and  papers,  and  3)  the  citations 
from  papers  to  papers,  from  patents  to  patents  and  from  patents  to  papers  provide 
indicators  of  intellectual  linkages  between  the  organizations  which  are  producing 
the  patents  and  papers,  and  knowledge  linkage  between  their  subject  areas  [Narin 
et  al,  1994],  Evaluative  bibliometrics  can  be  used  to: 

•  Identify  the  infrastructure  (authors,  journals,  institutions)  of  a  technical  domain, 

•  Identify  experts  for  innovation-enhancing  technical  workshops  and  review 
panels, 
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•  Develop  site  visitation  strategies  for  assessment  of  prolific  organizations 
globally, 

•  Identify  impacts  (literature  citations)  of  individuals,  research  units, 
organizations,  and  countries 

A  typical  text  mining  study  of  the  published  literature  develops  a  query  for 
comprehensive  information  retrieval,  processes  the  retrieved  database  using 
computational  linguistics  and  bibliometrics,  and  integrates  the  processed 
information. 

2.2  Unique  and  Advanced  Study  Features 

The  study  reported  in  the  present  paper  is  in  the  journal  article  abstract  category.  It 
differs  from  the  previous  published  papers  in  this  category  [Kostoff  et  al,  1997b, 
1998a,  1999a,  2000a,  2000b,  2002,  2004a,  2004b]  in  three  respects.  First,  the 
topical  domain  (Wireless  LANs)  is  completely  different.  Second,  document 
clustering  techniques  for  theme  categorization,  based  on  Greedy  String  Tiling  (see 
Appendix  1  for  more  details)  [Wise,  1992]  and  Clustering  Objective  Function 
Optimization  (see  Appendix  2  for  more  details)  [Zhao  and  Karypis,  2004]  for  text 
similarity,  were  developed  and  included,  to  complement  the  word/  concept 
clustering  approach.  Third,  factor  matrix  filtering  was  developed  and  used  to 
select  context-dependent  words  for  input  to  the  clustering  algorithm,  thereby 
leading  to  more  sharply  defined  clusters.  In  addition,  bibliometric  clustering  is 
presented  for  two  database  fields:  authors  and  countries  Finally,  the  marginal 
utility  algorithm  for  query  term  truncation  was  applied,  allowing  only  the  highest 
payoff  terms  to  be  included  in  the  final  query,  and  resulting  in  an  efficient  query. 

3.  DATABASE  GENERATION 

The  key  step  in  the  Wireless  LAN  literature  analysis  is  the  generation  of  the 
database  to  be  used  for  processing.  There  are  three  main  elements  to  database 
generation:  the  overall  objectives,  the  approach  selected,  and  the  database  used. 
Each  of  these  elements  is  described. 

3.1  Overall  Study  Objectives 

The  main  objective  was  to  identify  global  S&T  that  had  both  direct  and  indirect 
relations  to  Wireless  LANs.  A  sub-objective  was  to  estimate  the  overall  level  of 
global  effort  in  Wireless  LAN  technology,  as  reflected  by  the  emphases  in  the 
published  literature. 
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3.2  Databases  and  Approach 


For  the  present  study,  the  SCI  database  (including  both  the  Science  Citation  Index 
and  the  Social  Science  Citation  Index)  was  used.  The  approach  used  for  query 
development  was  the  DT-based  iterative  relevance  feedback  concept  [Kostoff  et  al, 
1997a], 

3.2.1  Science  Citation  Index/  Social  Science  Citation  Index  (SCI)  [SCI,  2002] 

The  retrieved  database  used  for  analysis  consists  of  selected  journal  records 
(including  the  fields  of  authors,  titles,  journals,  author  addresses,  author  keywords, 
abstract  narratives,  and  references  cited  for  each  paper)  obtained  by  searching  the 
Web  version  of  the  SCI  for  Wireless  LAN  articles.  At  the  time  the  final  data  was 
extracted  for  the  baseline  analysis  in  the  present  report  (mid-2002),  the  version  of 
the  SCI  used  accessed  about  5600  journals  (mainly  in  physical,  engineering,  and 
life  sciences  basic  research)  from  the  Science  Citation  Index,  and  over  1700 
journals  from  the  Social  Science  Citation  Index. 

The  SCI  database  selected  represents  a  fraction  of  the  available  Wireless  LAN 
(mainly  research)  literature,  that  in  turn  represents  a  fraction  of  the  Wireless  LAN 
S&T  actually  performed  globally  [Kostoff,  2000c],  The  articles  contained  within 
the  SCI  database  do  not  include  the  large  body  of  classified  literature,  or  company 
proprietary  technology  literature,  although  the  SCI  articles  could  reference  these 
literatures.  The  SCI  articles  do  not  include  technical  reports,  books,  or  patents  on 
Wireless  LANs,  but  could  again  reference  these  literatures.  For  the  computational 
linguistics  and  much  of  the  bibliometrics,  the  SCI  covers  a  finite  slice  of  time 
(1999  to  mid-2002).  The  database  used  represents  the  bulk  of  the  peer-reviewed 
high  quality  Wireless  LAN  research  literature,  and  is  a  representative  sample  of  all 
Wireless  LAN  research  in  recent  times. 

To  extract  the  relevant  articles  from  the  SCI,  the  Title,  Keyword,  and  Abstract 
fields  were  searched  using  Keywords  relevant  to  Wireless  LANs.  The  resultant 
Abstracts  were  culled  to  those  relevant  to  Wireless  LANs.  The  search  was 
performed  with  the  aid  of  two  powerful  DT  tools  (multi-word  phrase  frequency 
analysis  and  phrase  proximity  analysis)  using  the  process  of  Simulated  Nucleation 
[Kostoff  et  al,  1997a], 

An  initial  query  of  Wireless  LAN-related  terms  produced  two  groups  of  papers: 
one  group  was  judged  by  domain  experts  to  be  relevant  to  the  subject  matter,  the 
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other  was  judged  to  be  non-relevant.  Gradations  of  relevancy  or  non-relevancy 
were  not  considered.  An  initial  database  of  Titles,  Keywords,  and  Abstracts  was 
created  for  each  of  the  two  groups  of  papers.  Phrase  frequency  and  proximity 
analyses  were  performed  on  this  textual  database  for  each  group.  The  high 
frequency  single,  double,  and  triple  word  phrases  characteristic  of  the  relevant 
group,  and  their  boolean  combinations,  were  then  added  to  the  query  to  expand  the 
papers  retrieved.  Clustering  of  phrases  into  thematic  categories  was  performed  to 
help  guide  the  selection  of  phrases.  Phrases  from  each  of  the  thematic  categories 
were  selected  to  insure  balanced  category  representation  from  the  complete  sample 
of  relevant  records.  Similar  phrases  characteristic  of  the  non-relevant  group  are 
ordinarily  subtracted  from  the  query  to  contract  the  papers  retrieved,  but  the  terms 
used  in  the  present  study  were  sufficiently  focused  that  all  documents  retrieved 
were  relevant.  The  process  was  repeated  on  the  new  database  of  Titles,  Keywords, 
and  Abstracts  obtained  from  the  search.  A  few  more  iterations  were  performed 
until  the  number  of  records  retrieved  stabilized  (convergence).  The  final  phrase- 
based  query  used  for  the  Wireless  LAN  study  was  shown  in  the  Introduction. 

In  order  to  generate  an  efficient  final  query,  a  new  process  termed  Marginal  Utility 
was  applied.  At  the  start  of  the  final  iteration,  a  modified  query  Q1  was  inserted 
into  the  SCI,  and  records  were  retrieved.  Each  term  in  Q1  was  inserted  into  the 
Marginal  Utility  algorithm,  and  the  marginal  number  of  records  in  the  sample  that 
the  query  term  would  retrieve  was  computed.  Only  those  terms  that  retrieved  a 
substantial  number  of  additional  records  were  retained.  Since  (by  design)  each 
query  term  had  been  used  to  retrieve  records  from  the  SCI  as  part  of  Ql,  the 
marginal  ratio  of  additional  new  records  from  the  sample  would  represent  the 
marginal  ratio  of  additional  new  records  from  the  SCI.  The  final  efficient  query 
Q2,  consisting  of  the  highest  marginal  utility  terms,  was  shown  in  the  Introduction. 

In  the  Marginal  Utility  algorithm,  terms  that  co-occur  strongly  in  records  with 
previously-selected  terms  are  essentially  duplicative  from  the  retrieval  perspective, 
and  can  be  eliminated.  Thus,  the  order  in  which  terms  are  selected  becomes 
important.  An  automated  query  term  selection  algorithm  using  Marginal  Utility  is 
being  developed  that  will  examine  all  ordering  combinations,  in  order  to  identify 
the  most  efficient  query. 

The  authors  believe  that  queries  of  these  magnitudes  and  complexities  are  required 
when  necessary  to  provide  a  tailored  database  of  relevant  records  that  encompasses 
the  broader  aspects  of  target  disciplines.  In  particular,  if  it  is  desired  to  enhance  the 
transfer  of  ideas  across  disparate  disciplines,  and  thereby  stimulate  the  potential  for 
innovation  and  discovery  from  complementary  literatures  [Kostoff,  1999b,  2003b, 
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2005a],  then  even  more  complex  queries  using  Simulated  Nucleation  may  be 
required. 

4.  RESULTS 

The  results  from  the  publications  bibliometric  analyses  and  citations  bibliometrics 
analysis  are  presented  in  section  4.1,  followed  by  the  results  from  the  DT  analyses 
in  section  4.2.  The  SCI  bibliometric  fields  incorporated  into  the  database  included, 
for  each  paper,  the  author,  journal,  institution,  and  Keywords.  In  addition,  the  SCI 
included  references  for  each  paper. 

The  bibliometrics  section  (4.1)  has  three  components.  Some  numerical  indicators 
are  presented  for  each  bibliometric  examined.  Clustering  results,  that  portray 
cohesive  groups,  are  presented  for  most  prolific  authors  only.  Finally,  the  highest 
frequency  bibliometrics  (e.g.,  most  prolific  author,  most  prolific  country)  are 
presented  for  each  bibliometric,  and  discussed. 

The  DT  section  (4.2)  has  two  major  statistical  clustering  components:  concept  and 
document  clustering.  The  concept  clustering  groups  words  and  phrases  to  identify 
the  major  thematic  categories  of  the  database  primarily,  but  has  the  capability  to 
estimate  levels  of  emphasis  in  each  thematic  category  by  counting  phrase 
frequencies.  The  document  clustering  groups  and  counts  documents  to  identify  the 
levels  of  emphasis  in  each  thematic  category  primarily,  but  has  the  capability  to 
identify  the  major  thematic  categories  by  phrase  extraction. 

4.1  Bibliometric  Results 

The  first  group  of  metrics  presented  is  counts  of  papers  published  by  different 
entities.  These  metrics  can  be  viewed  as  output  and  productivity  measures.  They 
are  not  direct  measures  of  research  quality,  although  there  is  some  threshold 
quality  level  inferred,  since  these  papers  are  published  in  the  (typically)  high 
caliber  journals  accessed  by  the  SCI. 

These  more  recent  DT/  bibliometrics  studies  were  conducted  of  the  technical  fields 
of:  1)  Near-earth  space  (NES)  [Kostoff  et  al,  1998a];  2)  Hypersonic  and  supersonic 
flow  over  aerodynamic  bodies  (HSF)  [Kostoff  et  al,  1999a];  3)  Chemistry  (JACS) 
[Kostoff  et  al,  1997b]  as  represented  by  the  Journal  of  the  American  Chemical 
Society;  4)  Fullerenes  (FUL)  [Kostoff  et  al;  2000b]  5)  Aircraft  (AIR)  [Kostoff  et 
al,  2000a];  6)  Hydrodynamic  flow  over  surfaces  (HYD);  7)  Electric  Power  Sources 
(EPS)  [Kostoff  et  al,  2005b];  8)  Electrochemical  Power  Sources  (ECHEM) 
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[Kostoff  et  al,  2002]  9)  the  non-technical  field  of  research  impact  assessment 
(RIA)  [Kostoff  et  al,  1997b];  10)  Nonlinear  Dynamics  (NONLIN)  [Kostoff  et  al, 
2004a]  and  Fractals  (FRACT)  [Kostoff  et  al,  2004b],  Overall  parameters  of  these 
studies  from  the  SCI  database  results  and  the  current  Wireless  LAN  study  are 
shown  in  Table  1. 

TABLE  1  -  DT  STUDIES  OF  TOPICAL  FIELDS 


TOPICAL  AREA 

NUMBER  OF 
SCI  ARTICLES 

YEARS 

COVERED 

1)  NEAR-EARTH  SPACE  (NES) 

5480 

1993-MID  1996 

2)  HYPERSONICS  (HSF) 

1284 

1993-MID  1996 

3)CHEMISTRY  (JACS) 

2150 

1994 

4)  FULLERENES  (FUL) 

10515 

1991-MID  1998 

5)  AIRCRAFT  (AIR) 

4346 

1991-MID  1998 

6)  HYDRODYNAMICS  (HYD) 

4608 

1991-MID  1998 

7)  ELECTRIC  POWER  SOURCES  (EPS) 

20835 

1991-BEG  2000 

8)  ELECTROCHEMICAL  POWER 
SOURCES  (ECHEM) 

6985 

1993 -MID  2001 

9)  RESEARCH  ASSESSMENT  (RIA) 

2300 

1991-BEG  1995 

10)  NONLINEAR  DYNAMICS  (NONLIN) 

6118  (2001) 

1991,  2001 

1 1)  FRACTALS  (FRACT) 

4454 

(2001-02);  4211 
(1991-93) 

1991-93,  2001-02 

12)  WIRELESS  LANS  (WLANS) 

2328 

(1991-MID  2002) 

These  studies  yielded:  1)  3.37  authors  per  paper  for  the  NES  results;  2)  2.63 
authors  per  paper  for  the  HSF  results;  3)  3.79  authors  per  paper  for  the  JACS 
results;  4)  3.92  authors  per  paper  for  the  FUL  results;  5)  2.09  authors  per  paper  for 
the  AIR  results;  6)  2.29  authors  per  paper  for  the  HYD  results;  7)  2.90  authors  per 
paper  for  the  EPS  results;  8)  3.65  authors  per  paper  for  the  ECHEM  results.  9)  1.68 
authors  per  paper  for  the  RIA  results;  10)  2.68  authors  per  paper  for  the  NONLIN 
results;  and  1 1)  2.86  authors  per  paper  for  the  FRACT  results. 

In  the  present  Wireless  LANs  study,  two  types  of  computational  linguistics 
analysis  were  performed  on  the  author  data  field  in  the  database.  First,  a  frequency 
count  of  author  appearances  was  made  from  the  author  field  in  the  database,  to 
identify  the  most  prolific  authors.  Second,  a  clustering  analysis  was  performed  on 
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the  list  of  author  appearances,  to  identify  tightly-knit  multi-author  groups.  The 
clustering  methodology  (also  used  in  the  analysis  of  the  Abstract  free  text  field  to 
generate  technology  taxonomies)  is  described  in  Appendix  3. 

4.1.1  Prolific  Wireless  LANs  Authors 


For  the  period  1991  to  mid-2002,  there  were  2328  papers  retrieved,  4065  different 
authors,  and  6286  author  listings.  The  occurrence  of  each  author's  name  on  a  paper 
is  defined  as  an  author  listing.  While  the  average  number  of  listings  per  author  is 
about  1.5,  the  twenty  most  prolific  authors  (see  Table  2)  have  listings  more  than  an 
order  of  magnitude  greater  than  the  average.  The  number  of  papers  listed  for  each 
author  are  those  in  the  database  of  records  extracted  from  the  SCI  using  the  query, 
not  the  total  number  of  author  papers  listed  in  the  source  SCI  database. 


4. 1.1.1  Author  Frequency  Results 

TABLE  2  -  MOST  PROLIFIC  AUTHORS  -  1991 -MID-2002 
(present  institution  listed) 


AUTHOR  NAME 

INSTITUTION 

COUNTRY 

TECHNICAL  KEYWORDS 

#  OF 
PAPER 

S 

ADACHI,  F 

Tohoku  Univ 

Japan 

DS-CDMA;  MOBILE  RADIO;  RAKE 
COMBINING; 

51 

SAWAHASHI,  M 

NTT  DoCoMo  Inc 

Japan 

DS-CDMA;  MOBILE  RADIO;  ADAPTIVE 
ANTENNA  ARRAY; 

40 

FANTACCI,  R 

Univ  Florence 

Italy 

RESERVATION  MULTIPLE  ACCESS; 

PCN;  SATELLITE  SYSTEMS; 

16 

MILSTEIN,  LB 

Univ  Calif  San 
Diego 

USA 

COCHANNEL  INTERFERENCE;  CDMA; 
RANDOM  SIGNATURE  SEQUENCES; 

16 

NAKAGAWA,  M 

Keio  Univ 

Japan 

CDMA;  TIME  DIVISION  DUPLEX; 
MULTIMEDIA  NETWORK; 

16 

HIGUCHI,  K 

NTT  DoCoMo  Inc 

Japan 

DS-CDMA;  MOBILE  RADIO;  W-CDMA; 

15 

ZORZI,  M 

Univ  Ferrara 

Italy 

ALOHA;  MARKOV  CHANNEL;  PACKET 
ACCESS; 

15 

CHO,  DH 

Korea  Adv  Inst 

Sci  &  Technol 

South  Korea 

MULTIPLE-ACESS;  PCS;  HANDOVER; 

14 

TANAKA,  S 

NTT  DoCoMo  Inc 

Japan 

ADAPTIVE  ANTENNA  ARRAY;  DS- 
CDMA;  MOBILE  RADIO; 

12 

UMEHIRA,  M 

NTT 

Japan 

WIRELESS  ATM;  ARQ;  COHERENT 
DETECTION; 

12 

LEUNG,  VCM 

Univ  British 
Columbia 

Canada 

WIRELESS  ATM;  PERSONAL 
COMMUNICATION;  PATH 

OPTIMIZATION; 

11 

WU,  G 

Minist  Posts  & 
Telecommun 

Japan 

INTEGRATION  OF  VOICE  AND  DATA; 
MOBILE  MULTIMEDIA;  CHANNEL 
ACCESS  PROTOCOL; 

11 

GERLA,  M 

Univ  Calif  Los 

USA 

MOBILE  COMPUTING;  AD  HOC 

10 
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Angeles 

NETWORKS;  HIERARCHICAL  ROUTING; 

GIANNETTI,  F 

European  Space 
Agcy 

Italy 

CDMA;  SYNCHRONIZATION; 
INTERFERENCE  SUPPRESSION; 

10 

JEONG,  DG 

Hankuk  Univ 
Foreign  Studies 

South  Korea 

CDMA/TDD;  RESOURCE  ALLOCATION; 
TRAFFIC  ASYMMETRY; 

10 

KOBAYASHI,  T 

YRP  Mobile 
Telecommun  Key 
Technol  Res  Labs 
Co  Ltd 

Japan 

CDMA;  TRANSMISSION  POWER 
CONTROL;  MOBILE  COMMUNICATION; 

10 

MORINAGA,  N 

Osaka  Univ 

Japan 

QAM;  ADAPTIVE  MODULATION; 
DS/CDMA;  MOBILE  RADIO 
COMMUNICATIONS; 

10 

OKAWA,  K 

NTT  DoCoMo  Inc 

Japan 

MOBILE  RADIO;  W-CDMA;  ITERATIVE 
CHANNEL  ESTIMATION; 

10 

SOLLENBERGE 

R,  NR 

AT&T  Bell  Labs 

USA 

INTERFERENCE  SUPPRESSION; 
EQUALIZATION;  TRANSMITTER 
DIVERSITY; 

10 

BAIER,  PW 

Univ 

Kaiserslautern 

Germany 

ADAPTIVE  ANTENNAS;  TD-CDMA 
DOWNLINK;  CODE  POOLING; 

9 

Of  the  twenty  most  prolific  authors  listed  in  Table  2,  ten  are  from  Japan.  In  fact, 
twelve  are  from  Asia,  four  are  from  Europe  (Western),  and  four  are  from  North 
America.  Ten  are  from  universities,  seven  are  from  private  industry,  and  three  are 
from  research  institutes.  These  prolific  author  results  show  that  private  industry 
contained  a  substantial  portion  of  the  most  prolific  WLAN  research  and 
development  authors  in  the  1991  to  mid-2002  time  frame,  reflecting  the  more 
applied  research  nature  of  much  of  the  reported  work. 

Not  all  the  authors  or  papers  are  independent.  For  example,  Adachi,  Sawahashi, 
Higuchi,  and  Tanaka  published,  in  various  sub-groups,  many  papers  together. 

4.1.1 .2.  Author  Frequency  Results  Update 

Immediately  prior  to  publication  of  this  report,  it  was  decided  to  include  an 
updated  version  of  selected  bibliometrics  results.  For  the  author  frequency  results 
update,  the  query  shown  in  the  Introduction  (plus  the  word  MANET)  was  inserted 
in  the  SCI  search  engine,  the  time  frame  was  set  from  2004  to  mid-2005,  and  2293 
records  were  retrieved.  Using  the  newly-available  Analyze  function  on  the  SCI 
screen,  the  most  recent  2000  of  these  records  were  analyzed.  The  author  frequency 
results  for  2004-2005  are  shown  in  Table  2-U. 

TABLE  2-U  -  MOST  PROLIFIC  AUTHORS  -  2004-MID-2005 

AUTHOR  #PAPERS 

WONG,  KL  21 
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WANG,  Y  12 

LEE,  J  11 

YOU,  YH  11 

FANG,  YG  10 

HAN,  KJ  10 

KIM,  J  9 

GARCIA-LUNA-ACEVES,  JJ  8 
KIM,  Y  8 

LU,  SW  8 

PARK,  J  8 

SONG,  HK  8 

STOJMENOVIC,  I  8 

TSENG,  YC  8 

ADACHI,  F  7 

KIM,  D  7 

LIN,  YS  7 

PAN,  CY  7 

WANG,  L  7 


All  but  two  of  the  names  are  of  Asian  origin.  The  significance  of  this  will  be  seen 
in  the  update  of  most  prolific  countries. 


Figure  1  shows  the  number  of  WLAN  papers  published  as  a  function  of  time. 
Since  the  mid-90s,  there  has  been  an  approximate  linear  increase  in  papers 
published,  at  about  the  rate  of  1 00  papers  per  year. 


FIGURE  1  -  PAPERS  PUBLISHED  IN  THE  SCI  VS  TIME 
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#  of  Papers  Published 


World  Wide  Increase  of  WLANs  Related  Research 


4.1. 1.2  Author  Clustering  Results 

Two  statistical  clustering  processes  are  used  to  cluster  authors:  factor  matrix  and 
multi-link  aggregation.  A  correlation  matrix  among  the  authors  is  generated  as  the 
basis  for  the  factor  matrix  approach.  A  square  co-occurrence  matrix  of  the  240 
most  prolific  authors  (each  matrix  element  represents  the  number  of  times  each 
author  pair  is  listed  on  the  same  paper)  was  generated  as  the  basis  for  the  multi-link 
aggregation. 

4.1. 1.2.1  Factor  Matrix 

First,  a  list  of  authors  and  their  co-occurrences  in  the  WLAN  database  was 
generated.  The  list  and  co-occurrences  were  used  to  create  a  correlation  matrix, 
using  the  WINSTAT  software  package.  WINSTAT  then  used  the  correlation 
matrix  to  create  a  factor  matrix.  Each  matrix  element  contains  the  factor  loading,  a 
measure  of  the  importance  of  each  author  to  each  factor.  The  number  of  factors 
was  unconstrained,  but  the  eigenvalues  had  a  floor  of  unity.  Practically,  this  means 
that  each  factor  provides  some  additional  useful  information. 

Appendix  4  contains  the  factor  matrix  for  the  240  most  prolific  authors.  Factor 
loading  values  above  a  threshold  were  shaded.  Each  column  represents  one  factor, 
and  the  dark  vertical  bands  in  each  column  represent  the  essential  contributors  to 
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each  factor.  The  most  cohesive  factors  start  from  the  left  column,  and  proceed  to 
decrease  in  strength  monotonically  to  the  right.  Thus,  the  strongest  factor  is  the 
first,  consisting  of  the  authors  ranging  from  Ju  MC  to  Chow  JW. 

The  factors  are  not  fully  orthogonal,  so  some  authors  can  be  connected  strongly  to 
more  than  one  factor,  and  all  the  authors  that  represent  the  core  of  a  factor  are  not 
necessarily  presented  in  contiguous  form. 

4.1. 1.2.2  Multi-Link  Aggregation 

Multi-link  aggregation  clustering  of  WLAN  authors  was  performed  to  obtain  more 
details  on  how  strongly  linked  collaborating  authors  were,  as  well  as  to  obtain  a 
slightly  different  perspective  on  the  groupings.  The  clusters  of  related  authors  were 
generated  using  the  multi-link  aggregation  method  of  the  WINSTAT  software 
package  described  in  Appendix  3. 

Appendix  5  contains  the  full  dendrogram  for  the  240  most  prolific  authors.  The 
ordinate  is  the  ‘distance’,  a  measure  of  the  coupling  strength  between  authors,  or 
groups  of  authors.  Smaller  ‘distance’  means  stronger  coupling.  The  abcissa  is 
authors,  and  positioning  of  an  author,  or  group  of  authors,  along  the  axis  also 
reflects  the  relationships  among  authors  or  among  groups. 

The  authors  from  Factor  1  of  the  factor  matrix  are  shown  to  constitute  a  cluster 
(close  to  the  bottom  of  the  dendrogram).  In  this  portrayal,  however,  the  detailed 
structure  within  the  Factor  1  cluster  is  evident  from  the  dendrogram’s  tree-like 
structure.  Ju,  You,  and  Park  form  a  tightly  knit  unit  of  collaborating  authors;  that 
unit  combines  with  Paik  to  form  a  less  tightly  knit  unit,  and  then  combines  in  turn 
with  Cho. 

Table  2A  is  a  co-occurrence  matrix  of  the  authors  contained  in  Factor  1. 

TABLE  2A  -  FACTOR  1  AUTHORS  CO-OCCURRENCE  MATRIX 


#  OF 
PAPERS 

AUTHOR 

NAMES 

CHO-JW 

JU-MC 

PAIK-JH 

PARK-CH 

YOU-YH 

7 

CHO-JW 

7 

6 

5 

5 

6 

7 

JU— MC 

6 

7 

6 

6 

7 

6 

PAIK-JH 

5 

6 

6 

5 

6 

6 

PARK-CH 

5 

6 

5 

6 

6 
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7 

YOU-YH 

6 

7 

6 

6 

7 

The  closeness  of  the  authors  depicted  schematically  on  the  dendrogram  is 
confirmed  by  the  actual  numbers  of  papers.  All  of  Ju’s  and  You’s  papers  are 
published  with  each  other,  and  all  but  one  was  published  with  Park.  Paik  had  six 
papers,  all  of  which  were  published  with  Ju  and  You.  Hence,  Paik  is  the  next 
closest  group  member  in  the  dendrogram,  followed  by  Cho  who  also  published 
very  closely  with  all  the  other  authors. 

4.1.2  Journals  Containing  Most  Wireless  LAN  Papers 


There  were  285  different  journals  represented,  with  an  average  of  8.17  papers  per 
journal.  The  journals  containing  the  most  Wireless  LAN  papers  (see  Table  3)  had 
more  than  an  order  of  magnitude  more  papers  than  the  average. 

TABLE  3  -  JOURNALS  CONTAINING  MOST  PAPERS  -  1991-MID-2002 


JOURNAL  NAMES 

#  OF 
PAPERS 

IEEE  JOURNAL  ON  SELECTED  AREAS  IN  COMMUNICATIONS 

237 

IEICE  TRANSACTIONS  ON  COMMUNICATIONS 

183 

IEEE  TRANSACTIONS  ON  VEHICULAR  TECHNOLOGY 

159 

ELECTRONICS  LETTERS 

115 

IEEE  TRANSACTIONS  ON  COMMUNICATIONS 

105 

IEEE  COMMUNICATIONS  MAGAZINE 

94 

IEICE  TRANSACTIONS  ON  FUNDAMENTALS  OF 

ELECTRONICS  COMMUNICATIONS  AND  COMPUTER 
SCIENCES 

73 

MICROWAVES  &  RF 

71 

IEEE  PERSONAL  COMMUNICATIONS 

61 

WIRELESS  NETWORKS 

60 

COMPUTER  COMMUNICATIONS 

59 

MOBILE  NETWORKS  &  APPLICATIONS 

41 

COMPUTER  NETWORKS-THE  INTERNATIONAL  JOURNAL  OF 
COMPUTER  AND  TELECOMMUNICATIONS  NETWORKING 

39 

IEE  PROCEEDINGS-COMMUNICATIONS 

33 

IEEE  TRANSACTIONS  ON  CONSUMER  ELECTRONICS 

31 

ELECTRONICS  AND  COMMUNICATIONS  IN  JAPAN  PART  1- 
COMMUNICATIONS 

28 

WIRELESS  PERSONAL  COMMUNICATIONS 

28 

IEEE  COMMUNICATIONS  LETTERS 

27 

IEEE  NETWORK 

26 

IEEE  JOURNAL  OF  SOLID-STATE  CIRCUITS 

26 
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The  majority  of  the  journals  are  focused  on  communications,  with  the  remainder 
divided  between  networking  and  electronics.  Bradford’s  Law  allows  journals  to  be 
divided  into  groups,  with  each  group  containing  roughly  similar  numbers  of 
publications,  and  the  number  of  journals  in  each  group  differing  by  a  constant. 
There  appear  to  be  two  primary  groups  at  the  top  layer.  The  IEEE  Journal  on 
Selected  Areas  in  Communications  contains  the  most  articles  by  far  and,  along  with 
the  Journal  of  the  IEICE  Transactions  on  Communications  and  IEEE  Transactions 
on  Vehicular  Technology,  constitutes  the  first  group.  The  next  group  consists  of 
Electronics  Letters,  IEEE  Transactions  on  Communications,  IEEE 
Communications  Magazine,  IEICE  Transactions  on  Fundamentals  of  Electronics 
Communications  and  Computer  Sciences,  Microwaves  &  RF,  IEEE  Personal 
Communications,  and  Wireless  Networks. 


While  IEEE  Transactions  on  Vehicular  Technology  may  seem  out  of  place,  this 
journal  publishes  many  papers  dealing  with  electrical  and  electronics  technology  in 
vehicles  and  vehicular  systems.  The  areas  of  communications,  transportation 
systems,  and  vehicular  electronics  define  its  scope. 


4.1.2. 1.  Journals  Containing  Most  Wireless  LAN  Papers  Update 

The  2004-mid-2005  papers  were  retrieved  from  the  SCI,  and  the  journals 
containing  the  most  Wireless  LANs  papers  were  identified.  The  results  are  shown 
on  Table  3-U. 

TABLE  3-U  -  JOURNALS  CONTAINING  MOST  PAPERS  -  2004-MID-2005 


JOURNAL  #PAPERS 

IEICE  TRANSACTIONS  ON  COMMUNICATIONS  121 

IEEE  JOURNAL  ON  SELECTED  AREAS  IN 

COMMUNICATIONS  83 

COMPUTER  COMMUNICATIONS  59 

WIRELESS  COMMUNICATIONS  &  MOBILE  COMPUTING  53 

IEEE  TRANSACTIONS  ON  WIRELESS  COMMUNICATIONS  50 

MICROWAVE  AND  OPTICAL  TECHNOLOGY  LETTERS  47 

IEEE  TRANSACTIONS  ON  MOBILE  COMPUTING  44 

ELECTRONICS  LETTERS  37 

IEEE  TRANSACTIONS  ON  CONSUMER  ELECTRONICS  37 

IEEE  WIRELESS  COMMUNICATIONS  37 

NETWORKING- ICN  2005,  PT  2  37 

IEEE  COMMUNICATIONS  LETTERS  35 

IEEE  JOURNAL  OF  SOLID-STATE  CIRCUITS  35 

WIRELESS  NETWORKS  35 
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IEEE  COMMUNICATIONS  MAGAZINE  32 

COMPUTER  NETWORKS-  31 

NETWORKING  2004  30 

MOBILE  NETWORKS  &  APPLICATIONS  29 

TELECOMMUNICATIONS  AND  NETWORKING  -  ICT  2004  29 


Of  the  21  journals  pre-2003,  and  the  19  journals  post-2003  listed,  ten  were  in 
common.  Of  those  not  shared,  many  of  the  recent  papers  appear  in  journals  whose 
titles  suggest  a  wireless  focus  and  a  mobile  networking  focus.  Because  of  the 
technology  specificity,  these  more  recent  papers  have  even  more  of  an  application 
focus  of  what  was  perceived  as  an  already  quite  applied  discipline. 


4.1.3  Institutions  Producing  Most  Wireless  LAN  Papers 


A  similar  process  was  used  to  develop  a  frequency  count  of  institutional  address 
appearances.  It  should  be  noted  that  many  different  organizational  components 
may  be  included  under  the  single  organizational  heading  (e.g.,  Univ  Calif  Los 
Angeles  could  include  the  Department  of  Computer  Science,  Department  of 
Electrical  Engineering,  etc.).  Identifying  the  higher  level  institutions  is 
instrumental  for  these  DT  studies.  Once  they  have  been  identified  through 
bibliometric  analysis,  subsequent  measures  may  be  taken  (if  desired)  to  identify 
particular  departments  within  an  institution. 

TABLE  4  -  PROLIFIC  INSTITUTIONS  -  1991-MID-2002 


INSTITUTION 

COUNTRY 

#  OF 
PAPERS 

NTT 

JAPAN 

136 

AT&T  BELL  LABS 

USA 

93 

UNIV  CALIF  LOS  ANGELES 

USA 

41 

UNIV  CALIF  SAN  DIEGO 

USA 

35 

NEC  CORP 

USA 

35 

NATL  CHIAO  TUNG  UNIV 

TAIWAN 

33 

KOREA  ADV  INST  SCI  &  TECHNOL 

SOUTH  KOREA 

29 

NATL  UNIV  SINGAPORE 

SINGAPORE 

27 

KEIO  UNIV 

JAPAN 

27 

STANFORD  UNIV 

USA 

25 

UNIV  CALIF  BERKELEY 

USA 

25 

GEORGIA  INST  TECHNOL 

USA 

25 

UNIV  TEXAS 

USA 

24 

RUTGERS  STATE  UNIV 

USA 

24 

IBM 

USA 

23 

LUCENT TECHNOL 

USA 

22 
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SIEMENS  AG 

GERMANY 

22 

MOTOROLA  INC 

USA 

21 

ETRI 

SOUTH  KOREA 

20 

SEOUL  NATL  UNIV 

SOUTH  KOREA 

20 

Of  the  twenty  most  prolific  institutions,  ten  are  from  Asia,  five  are  from  Western 
Europe,  four  from  the  USA,  and  one  from  Eastern  Europe.  Ten  are  universities, 
eight  are  privately  owned  businesses  and  the  remaining  institutions  are  research 
institutes.  These  distributions  reflect  those  of  the  most  prolific  authors,  and 
confirm  the  substantial  contribution  of  the  private  sector. 


4.1.3. 1.  Institutions  Producing  Most  Wireless  LAN  Papers  Update 


The  2004-mid-2005  papers  were  retrieved  from  the  SCI,  and  the  institutions 
producing  the  most  Wireless  LANs  papers  were  identified.  The  results  are  shown 
on  Table  4-U. 

TABLE  4-U  -  INSTITUTIONS  PRODUCING  MOST  PAPERS  -  2004-MID- 


2005 

INSTITUTION 

#PAPERS 

NATL  SUN  YATSEN  UNIV 

36 

KOREA  UNIV 

33 

GEORGIA  INST  TECHNOL 

30 

NATL  CHIAO  TUNG  UNIV 

28 

UNIV  ILLINOIS 

27 

TSING  HUA  UNIV 

26 

NANYANG  TECHNOL  UNIV 

25 

KYUNGPOOK  NATL  UNIV 

21 

UNIV  TEXAS 

21 

UNIV  OTTAWA 

20 

UNIV  CALIF  LOS  ANGELES 

19 

UNIV  FLORIDA 

19 

NATL  TAIWAN  UNIV 

18 

SEJONG  UNIV 

18 

ARIZONA  STATE  UNIV 

17 

INDIAN  INST  TECHNOL 

17 

UNIV  WATERLOO 

17 

HONG  KONG  UNIV  SCI  &  TECHNOL 

16 

UNIV  MARYLAND 

16 

YONSEI  UNIV 

16 

There  has  been  a  shift  in  the  most  prolific  institutions  from  mainly  USA  to  Asian. 
Also,  none  of  the  industrial  organizations  (that  were  so  prominent  in  Table  4) 
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appear  in  the  latest  results.  This  absence  of  industry  can  be  seen  more  dramatically 
by  displaying  the  results  from  the  period  1991-1995  only.  Table  4-U-U  presents 
these  results  for  the  top  26  institutions.  Over  25%  are  industrial  organizations. 

TABLE  4-U-U  -  INSTITUTIONS  PRODUCING  MOST  PAPERS  -  1991-1995 


INSTITUTION  #PAPERS 


KERNFORSCHUNGSZENTRUM  KARLSRUHE  GMBH  21 

NIPPON  TELEGRAPH  &  TEL  PUBL  CORP  16 

AT&T  BELL  LABS  11 

COMMISS  EUROPEAN  COMMUNITIES  11 

NATL  UNIV  SINGAPORE  10 

UNIV  ANCONA  9 

UNIV  CALIF  SAN  DIEGO  7 

UNIV  KAISERSLAUTERN  7 

BELLCORE  6 

CARLETON  UNIV  6 

DELFT  UNIV  TECHNOL  6 

IBM  CORP  6 

UNIV  ROMA  TOR  VERGATA  6 

ENEA  CASACCIA  5 

KOREA  ADV  INST  SCI  &  TECHNOL  5 

UNIV  CALIF  DAVIS  5 

KFA  JULICH  GMBH  4 

MOTOROLA  INC  4 

POLITECN  TURIN  4 

RUSSIAN  ACAD  SCI  4 

SIEMENS  AG  4 

UNIV  ARIZONA  4 

UNIV  CALIF  BERKELEY  4 

UNIV  VICTORIA  4 

USN  4 

ENEA  3 


4.1.4  Countries  Producing  Most  Wireless  LAN  Papers 

There  are  51  different  countries  listed  in  the  results.  The  country  bibliometric 
results  are  summarized  in  Table  5.  The  dominance  of  a  handful  of  countries  is 
clearly  evident. 


TABLE  5  -  PROLIFIC  COUNTRIES  -  1991-MID-2002 


COUNTRY 


#  OF 
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PAPERS 

USA 

754 

JAPAN 

361 

SOUTH  KOREA 

146 

ITALY 

129 

GERMANY 

124 

TAIWAN 

118 

ENGLAND 

114 

CANADA 

112 

SINGAPORE 

59 

AUSTRALIA 

50 

PEOPLES  R 

CHINA 

49 

NETHERLANDS 

48 

GREECE 

42 

FRANCE 

37 

FINLAND 

36 

SWEDEN 

28 

SPAIN 

27 

INDIA 

23 

HONG  KONG 

18 

SWITZERLAND 

18 

There  appear  to  be  three  dominant  groups  in  the  twenty  most  prolific  countries. 

The  USA  is  in  a  group  of  its  own,  with  more  than  twice  the  number  of  papers  as 
the  next  most  prolific  country.  Japan  constitutes  the  second  most  dominant  group 
with  substantially  more  papers  than  the  third  most  prolific  country.  The  sum  of 
papers  from  the  USA  and  Japan  almost  surpasses  the  cumulative  sum  of  the  next 
eighteen  most  prolific  countries.  South  Korea,  Italy,  Germany,  Taiwan,  England 
and  Canada  make  up  the  third  most  prolific  group.  Of  the  eleven  most  prolific 
countries,  five  are  Asian,  confirming  the  prolific  author  and  institution  results. 

With  the  exception  of  perhaps  China,  the  developing  nations  are  playing  almost  no 
role  in  the  literature  contribution  of  Wireless  LANs. 

4. 1.4.1  Country  Publications  With  Respect  To  Time 

There  has  been  a  dramatic  increase  in  the  number  of  publications  that  different 
countries  have  produced,  dealing  with  Wireless  LANs.  As  figure  2  illustrates,  the 
most  notable  increase  has  been  within  the  United  States.  Japan,  having  less  of  an 
increase  than  the  United  States,  still  has  had  an  increase  in  publications  that  well 
surpasses  any  of  the  other  countries. 
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Num.  of  Publications 


FIGURE  2  -  COUNTRY  PUBLICATIONS  VS  TIME 


Country  Publications  per  Year  (1991  -  2001) 


USA  JAPAN  SOUTH  KOREA  ITALY  — * — GERMANY  -  -  -  TAIWAN  — e — rNGIANP  CAXAPA 


Appendix  6  contains  a  co-occurrence  matrix  of  the  top  15  countries.  In  terms  of 
absolute  numbers  of  co-authored  papers,  the  USA  major  partners  are  Italy,  Japan, 
Canada,  and  South  Korea.  Of  the  top  15  countries  publishing  Wireless  LAN 
papers,  most  did  not  have  a  substantial  amount  of  inter-country  published  papers. 
In  cases  where  there  was  inter-country  publishing,  it  typically  was  with  countries 
in  similar  geographical  regions.  The  USA  was  perhaps  the  exception  -  co¬ 
publishing  with  many  other  countries  from  various  geographical  regions. 

4. 1.4.2.  Countries  Producing  Most  Wireless  LAN  Papers  Update 

The  2004-mid-2005  papers  were  retrieved  from  the  SCI,  and  the  countries 
producing  the  most  Wireless  LANs  papers  were  identified.  The  results  are  shown 
on  Table  5-U. 


Page  31 


TABLE  5-U  -  COUNTRIES  PRODUCING  MOST  PAPERS  -  2004-MID-2005 


COUNTRY  #PAPERS 

USA  572 

SOUTH  KOREA  246 

JAPAN  166 

TAIWAN  165 

PEOPLES  R  CHINA  140 
CANADA  121 

GERMANY  96 

ITALY  92 

FRANCE  83 

GREECE  66 

ENGLAND  57 

SPAIN  52 

INDIA  50 

SINGAPORE  49 

SWITZERLAND  35 

NETHERLANDS  31 

FINLAND  29 

AUSTRALIA  26 

SWEDEN  20 

TURKEY  19 

BELGIUM  17 

BRAZIL  16 


For  dramatic  comparison,  the  countries  producing  the  most  papers  from  1991-1995 
were  tabulated.  The  results  are  shown  in  Table  5-U-U. 

TABLE  5-U-U  -  COUNTRIES  PRODUCING  MOST  PAPERS  -  1991-1995 


COUNTRY 

#PAPERS 

USA 

103 

GERMANY 

50 

JAPAN 

45 

ITALY 

38 

CANADA 

25 

ENGLAND 

14 

NETHERLANDS 

12 

SINGAPORE 

10 

FRANCE 

9 

SOUTH  KOREA 

7 

AUSTRALIA 

6 

GREECE 

4 

RUSSIA 

4 

SWITZERLAND 

4 

INDIA 

3 
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SWEDEN  3 

TAIWAN  3 

ARGENTINA  2 

BYELARUS  2 

HONG  KONG  2 

ISRAEL  2 

NEW  ZEALAND  2 

PORTUGAL  2 

SCOTLAND  2 

SPAIN  2 


There  has  been  a  major  shift  in  country  emphasis  in  the  past  decade.  Except  for 
the  USA,  the  Western  nations’  leadership  has  been  replaced  by  that  of  the  Asian 
nations.  Especially  large  advances  have  been  made  by  South  Korea  (7%  of  USA 
paper  production  a  decade  ago  to  43%  of  USA  production  presently),  Taiwan  (3% 
of  USA  to  29%  of  USA),  and  Peoples  Republic  of  China  (1%  of  USA  to  25%  of 
USA). 

4.1.5  Citation  Statistics  on  Authors,  Papers,  and  Journals 


The  second  group  of  metrics  presented  is  counts  of  citations  to  papers  published  by 
different  entities.  While  citations  are  ordinarily  used  as  impact  or  quality  metrics 
[Garfield,  1985],  much  caution  needs  to  be  exercised  in  their  frequency  count 
interpretation,  since  there  are  numerous  reasons  why  authors  cite  or  do  not  cite 
particular  papers  [Kostoff,  1988b;  MacRoberts,  1996], 


The  citations  in  all  the  retrieved  SCI  papers  (from  1991  to  mid-2002)  were 
aggregated.  The  authors,  specific  papers,  years,  journals,  and  countries  cited  most 
frequently  were  identified,  and  were  presented  in  order  of  decreasing  frequency.  A 
small  percentage  of  any  of  these  categories  received  large  numbers  of  citations. 
From  the  citation  year  results,  the  most  recent  papers  tended  to  be  the  most  highly 
cited.  This  reflected  rapidly  evolving  fields  of  research. 


4. 1.5.1  Most  Cited  First  Authors 


The  most  highly  cited  first  authors  are  listed  in  Table  6. 

TABLE  6  -  MOST  CITED  FIRST  AUTHORS 
(cited  by  other  papers  in  this  database  only) 


AUTHOR 

INSTITUTION 

COUNTRY 

TIMES  CITED 

VITERBI  AJ 

Qualcomm  Inc 

USA 

276 

RAYCHAUDHURI  D 

Rutgers  State  Univ 

USA 

267 
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PROAKIS  JG 

Northeastern  Univ 

USA 

235 

PERKINS  C 

Nokia  Res  Ctr 

USA 

233 

GILHOUSEN  KS 

Qualcomm  Inc 

USA 

226 

LEE  WCY 

LinkAir  Commun  Inc 

USA 

191 

GOODMAN  DJ 

Polytech  Univ 

USA 

189 

KLEINROCK  L 

Univ  Calif  Los  Angeles 

USA 

150 

ADACHI  F 

Tohoku  Univ 

Japan 

146 

PURSLEY  MB 

Clemson  Univ 

USA 

140 

TOBAGI  FA 

Stanford  Univ 

USA 

137 

ZORZI  M 

Univ  Ferrara 

Italy 

136 

RAPPAPORT  TS 

Virginia  Polytech  Inst  & 

State  Univ 

USA 

134 

JAKES  WC 

Bell  Tel  Labs  Inc 

USA 

130 

VERDU  S 

Princeton  Univ 

USA 

118 

PAHLAVAN  K 

Worcester  Polytech  Inst 

USA 

111 

KOHNO  R 

Futaba  Corp 

Japan 

103 

TOH  CK 

Georgia  Inst  Technol 

USA 

99 

ACAMPORA  AS 

Univ  Calif  San  Diego 

USA 

94 

JOHNSON  DB 

Carnegie  Mellon  Univ 

USA 

94 

Of  the  twenty  most  cited  authors,  seventeen  are  from  North  America  (USA),  two 
are  from  Asia  (Japan),  and  one  is  from  Europe  (Italy).  This  is  a  far  different 
distribution  from  the  most  prolific  authors,  where  twelve  were  from  Asia  (Far 
East).  There  are  a  number  of  potential  reasons  for  this  difference,  including 
difference  in  quality  and  late  entry  into  the  Wireless  LANs  research  discipline.  In 
another  three  or  four  years,  when  the  papers  from  present-day  authors  have 
accumulated  sufficient  citations,  firmer  conclusions  about  quality  can  be  drawn. 


The  lists  of  twenty  most  prolific  authors  and  twenty  most  highly  cited  first  authors 
had  only  two  names  in  common  (ADACHI,  ZORZI).  This  phenomenon  of 
minimal  intersection  has  been  observed  in  all  other  text  mining  studies  performed 
by  the  first  author  of  the  present  report.  The  time  frame  of  interest  for  most  prolific 
authors  is  present  time,  whereas  the  time  frame  of  interest  for  most  cited  can  span 
many  decades.  Researchers  who  may  very  well  have  been  prolific  when  their  most 
citable  work  was  done  may  no  longer  be  prolific.  Finally,  many  prolific  authors, 
especially  at  academic  institutions,  tend  not  to  be  first  authors  in  most  of  their 
publications.  Therefore,  they  tend  to  be  severely  under-represented  in  the  SCI 
citation  downloads,  which  contain  first  author  only. 


Twelve  of  the  top  twenty  cited  authors’  institutions  are  universities,  and  five  are 
private  companies.  This  differs  somewhat  from  the  most  prolific  authors  in  that 
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the  distribution  between  universities  and  privately  owned  businesses  is  ten  to  seven 
respectively. 


The  citation  data  for  authors  and  journals  represents  citations  generated  only  by  the 
specific  records  extracted  from  the  SCI  database  for  this  Wireless  LANs  study.  It 
does  not  represent  all  the  citations  received  by  the  references  in  those  records; 
these  references  (in  the  database  records)  could  have  been  cited  additionally  by 
papers  in  other  technical  disciplines. 

4.1.5.2  Most  Cited  Papers 

The  most  highly  cited  papers  are  listed  in  Table  7. 

TABLE  7  -  MOST  CITED  PAPERS 
(total  citations  from  retrieved  papers) 


PUBLICATIONS  (PAPER  /  BOOKS  ) 

TIMES 

CITED 

GILHOUSEN  KS,  1991,  IEEE  T  VEH  TECHNOL,  V40,  P303 
(. Interference  suppression  features  of  CDMA,  and  the  resulting 
increase  in  capacity  over  competing  multiple  access  techniques) 

179 

PROAKIS  JG,  1989,  DIGITAL  COMMUNICATIO 

(Basic  principles  that  underlie  the  analysis  and  design  of  digital 

communication  systems) 

93 

VITERBI  AJ,  1995,  CDMA  PRINCIPLES  SPRE 
( CDMA  and  the  commercial  implementation  of  spread  spectrum 
technology  that  underpins  the  surge  in  personal  wireless 
communication) 

89 

JAKES  WC,  1974,  MICROWAVE  MOBILE  COM 

(Basic  microwave  theory,  overview  of  cellular  systems,  and  useful 

techniques  for  effective  mobile  microwave  systems  development) 

85 

PROAKIS  JG,  1995,  DIGITAL  COMMUNICATIO 

(Basic  principles  that  underlie  the  analysis  and  design  of  digital 

communication  systems) 

83 

RAYCHAUDHURI  D,  1994,  IEEE  J  SEL  AREA  COMM,  VI 2, 

P1401 

(A  TM-Based  transport  architecture  for  multi-services  wireless 
personal  communication-networks) 

79 

GOODMAN  DJ,  1989,  IEEE  T  COMMUN,  V37,  P885 
(Packet  reservation  multiple  access  for  local  wireless 
communications) 

73 

LEE  WCY,  1991 ,  IEEE  T  VEH  TECHNOL,  V40,  P291 
(Overview  of  cellular  CDMA) 

68 

PURSLEY  MB,  1977,  IEEE  T  COMMUN,  V25,  P795 

67 
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(j Evaluation  for  phase-coded  spread-spectrum  multiple-access 
communication) 

BAIERA,  1994,  IEEE  J  SEL  AREA  COMM,  V12,  P733 
( CDMA-Based  3rd-generation  mobile  radio  system) 

51 

RAPPAPORTTS,  1996,  WIRELESS  COMMUNICATI 
( Wireless  communications  technology  and  system  design.  Covers 
the  basic  issues  affecting  all  wireless  networks  and  reviews  new 
technological  developments,  with  particular  attention  to  SG 
systems  and  wireless  LANs) 

49 

ACAMPORAAS,  1994,  IEEE  J  SEL  AREA  COMM,  V12,  P1365 
(Architecture  and  methodology  for  mobile-executed  handoff  in 
cellular  ATM  networks) 

49 

VITERBI  AM,  1 993,  IEEE  J  SEL  AREA  COMM,  V1 1 ,  P892 
(. Erlang  capacity  of  a  power  controlled  CDMA  system) 

46 

PICKHOLTZ  RL,  1991,  IEEE  T  VEH  TECHNOL,  V40,  P313 
(Spread  spectrum  for  mobile  communications) 

43 

RAYCHAUDHURI  D,  1997,  IEEE  J  SEL  AREA  COMM,  VI 5,  P83 
(Prototype  wireless  A  TM  system  for  multimedia  personal 
communication) 

41 

BERTSEKAS  D,  1992,  DATA  NETWORKS 

(An  exploration  of  data  networks,  evolving  integrated  networks, 

and  the  various  analysis  and  design  tools) 

40 

KLEINROCK  L,  1975,  IEEE  T  COMMUN,  V23,  P1400 
(Packet  switching  in  radio  channels.  CSMA  modes  and  their 
throughput-delay  characteristics) 

39 

ADACHI  F,  1998,  IEEE  COMMUN  MAG,  V36,  P56 

(Wideband  DS-CDMA  for  next-generation  mobile  communications 

systems) 

39 

LIN  S,  1983,  ERROR  CONTROL  CODING 

(The  fundamentals  of  coding  and  the  applications  of  codes  to  the 

design  of  real  error  control  systems) 

38 

NANDAS,  1991,  IEEE  T  VEH  TECHNOL,  V40,  P584 
(Performance  of  PRMA  -  a  packet  voice  protocol  for  cellular- 
systems) 

38 

The  theme  of  each  paper  is  shown  in  italics  on  the  line  after  the  paper  listing.  The 
order  of  paper  listings  is  by  number  of  citations  by  the  Wireless  LAN  papers  in  the 
extracted  database. 

The  focal  points  of  the  majority  of  the  papers  shown  here  are  CDMA  (Code- 
Division  Multiple  Access)  and  other  protocols  within  Wireless  LANs.  There  are  a 
number  of  other  topics  addressed  within  the  papers  as  well,  including  data 
transmission  and  throughput  in  ATM  (Asynchronous  Transfer  Mode)  networks  and 
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Spread  Spectrum  technology,  Network  Architecture,  and  a  broader  group  that 
focuses  on  different  applications  for  Wireless  Networks. 

A  more  comprehensive  listing  of  highly  cited  documents  is  contained  in  Appendix 
13.  In  this  Appendix,  the  first  author’s  Citation-Assisted  Background  (CAB) 
method  [Kostoff  and  Shlesinger,  2005d]  is  used  to  identify  the  seminal  documents 
in  Wireless  LANs,  and  present  these  seminal  documents  in  chronological  order. 


4.I.5.3.  Most  Cited  Journals 


TABLE  8  -  MOST  CITED  JOURNALS 
(cited  by  other  papers  in  this  database  only) 


JOURNAL 

TIMES  CITED 

IEEE  T  COMMUN 

2804 

IEEE  J  SEL  AREA  COMM 

2178 

IEEE  T  VEH  TECHNOL 

1545 

IEEE  COMMUN  MAG 

691 

IEEE  T  INFORM  THEORY 

500 

IEEE  PERS  COMMUN 

499 

P  IEEE 

490 

IEICET  COMMUN 

380 

ELECTRON  LETT 

352 

DIGITAL  COMMUNICATIO 

310 

IEEE  JSOLID-ST  CIRC 

224 

IEEE  T  SIGNAL  PROCES 

206 

BELL  SYSTTECH  J 

194 

IEEE  ACM  T  NETWORK 

188 

IEEE  T  ANTENN  PROPAG 

168 

WIREL  NETW 

164 

MICROWAVE  MOBILE  COM 

134 

IEEE  T  COMPUT 

124 

COMPUT  NETWORKS  ISDN 

123 

IEEE  NETWORK 

108 

The  list  of  most  cited  journals  is  very  similar  to  the  list  of  most  prolific  journals, 
although  some  items  are  not  in  the  same  order. 


Based  on  the  bibliometric  results,  it  is  recommended  that  those  who  are  interested 
in  researching  wireless  LANs  would  be  well-advised  to,  first,  obtain  the 
highly-cited  papers  listed  and,  second,  peruse  those  sources  that  are  highly  cited 
and/or  contain  large  numbers  of  recently  published  papers. 
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4.2  Database  Tomography  Results 

There  are  two  major  analytic  methods  used  in  this  section  to  generate  taxonomies 
of  the  SCI  databases:  concept  clustering,  based  on  phrase/  word  aggregation,  and 
document  clustering,  based  on  document  aggregation.  Counting  of  documents 
within  each  major  cluster/  category  provides  some  estimate  of  level  of  effort 
within  the  thematic  area  represented  by  the  cluster. 

4.2.1  Taxonomies 

Two  statistically  based  concept  clustering  methods  were  used  to  develop 
taxonomies:  factor  matrix  clustering  and  multi-link  clustering.  Both  offer  different 
perspectives  on  taxonomy  category  structure  from  the  document  clustering 
approach  described  later. 

In  this  section,  a  synergistic  combination  of  factor  matrix  and  multi-link  clustering 
is  described  that  offers  substantial  improvement  in  the  quality  of  the  resultant 
clusters.  Once  the  appropriate  factor  matrix  has  been  generated,  the  factor  matrix 
can  then  be  used  as  a  filter  to  identify  the  significant  technical  words/  phrases  for 
further  analysis.  Specifically,  the  factor  matrix  can  complement  a  basic  trivial 
word  list  (i.e.,  a  list  containing  words  that  are  trivial  in  almost  all  contexts,  such  as 
‘a’,  ‘the’,  ‘of,  ‘and’,  ‘or’,  etc)  to  select  context-dependent  high  technical  content 
words/  phrases  for  input  to  a  clustering  algorithm.  The  factor  matrix  pre-filtering 
will  improve  the  cohesiveness  of  clustering  by  eliminating  those  words/  phrases 
that  are  trivial  words  operationally  in  the  application  context  [Kostoff,  2003d; 
Kostoff  et  al,  2005c], 

4.2. 1.1  Statistical  Clustering 

Appendix  3  describes  the  statistical  clustering  methodologies  in  more  detail. 
Appendix  3A  overviews  the  generic  statistically-based  clustering  approach, 
Appendix  3B  describes  the  factor  matrix  clustering,  and  Appendix  3C  describes 
the  multi-link  clustering. 

4.2. 1.1.1  Concept  Clustering 

4.2. 1.1. 1.1  Word  and  Multiple-Phrase  Factor  Matrices 
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Correlation  matrices  of  the  735  highest  frequency  high  technical  content  words  and 
704  highest  frequency  high  technical  content  multiple-word  phrases  were 
generated,  and  factor  analyses  were  performed  using  the  Tech  Oasis  statistical 
package.  Based  on  Scree  Plot  analysis  to  estimate  the  number  of  factors  required, 
factor  matrices  consisting  of  12  factors  resulted.  A  description  of  these  factors, 
and  their  aggregation  into  taxonomies,  follows.  The  words/  phrases  in  parentheses 
represent  typical  high  factor  loading  words/  phrases  for  that  factor.  The  complete 
word  factor  matrix  is  presented  in  Appendix  7,  and  the  complete  phrase  factor 
matrix  is  presented  in  Appendix  8.  In  both  these  appendices,  the  rows  contain  the 
words  or  phrases  extracted  from  the  free  text  Abstracts,  and  the  columns  contain 
the  factors.  Each  factor  represents  a  technical  theme  of  the  database,  with  the  first 
factor  being  the  most  coherent,  and  the  coherency  decreasing  monotonically  to  the 
last  factor. 

In  the  multi-link  section  following  the  factor  matrices,  taxonomies  will  be 
generated  using  the  multi-link  hierarchical  clustering  approach.  The  twelve  factors 
for  words  and  phrases  will  then  be  assigned  to  the  appropriate  categories  in  the 
taxonomy,  providing  good  coverage  and  an  excellent  match. 

4.2.1. 1. 1.1.1  Word  Factor  Matrix 

Factor  1  (ARQ,  error,  automatic,  Correction,  retransmission,  request,  bit,  data,  link, 
rate,  transport,  throughput,  errors,  TCP,  forward,  layer)  -  focuses  on  the  use  of  the 
ARQ  (Automatic  Repeat  Request)  scheme  to  compile  erroneous  and  fragmented 
packets.  Multiple  copies  of  the  packets  are  repeatedly  transmitted  until  the  error  is 
corrected. 

Factor  2  (MAC,  collision,  access,  protocol,  traffic,  contention,  medium, 
reservation,  packet,  throughput,  priority,  slots)  -  focuses  on  a  number  of  different 
MAC  (Medium  Access  Control)  protocols,  whose  function  is  to  prevent  multi¬ 
station  signals  from  colliding  on  identical  channels. 

Factor  3  (multimedia,  IP,  technology,  broadband,  research,  technical,  issues, 
future,  telecommunications,  service,  Technologies,  Internet,  services)  -  focuses  on 
broadband  multimedia  transmission  that  can  carry  numerous  channels  at  once, 
allowing  rapid  data  transfer. 

Factor  4  (equalizer,  equalization,  multipath,  interference,  blind,  multi-user, 
Response,  feedback,  inter-symbol,  estimation,  receiver,  adaptive,  impulse)  - 
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focuses  on  the  use  of  DFE  (Decision  Feedback  Equalization)  to  combat  multipath 
ISI  (inter-symbol  interference),  prevalent  within  indoor  wireless  data  networks. 

Factor  5  (hosts,  routing,  network,  multicast,  hoc,  ad,  nodes,  protocols,  distributed, 
ATM,  implementation)  -  focuses  on  mitigating  the  problem  faced  when  mobile 
hosts  attempt  to  communicate  with  one  another  on  an  ad-hoc  network. 

Factor  6  (CMOS,  supply,  GHz,  V,  circuit,  amplifier,  chip,  power,  RF,  MHz, 
transceiver,  dB,  band)  -  focuses  on  CMOS  (complimentary  metal  oxide 
semiconductors).  GHz-range  radio  frequency  and  baseband  circuits  rely  heavily  on 
CMOS  technology. 

Factor  7  (ATM,  Call,  QoS,  handoff,  admission,  blocking,  service,  transfer,  calls, 
traffic,  quality,  connections)  -  focuses  on  ATM  (Asynchronous  Transfer  Mode),  a 
network  technology  based  on  transferring  data  in  cells.  These  types  of  networks 
can  guarantee  a  constant  level  of  QoS  (Quality-of-service)  at  all  times. 

Factor  8  (Rake,  coherent,  pilot,  background,  W-CDMA,  reverse,  laboratory,  SIR, 
DS-CDMA,  transmit,  diversity,  ratio,  antenna,  signal-to-interference,  combining)  - 
focuses  on  W-CDMA,  which  employs  a  fast  cell-search  algorithm  and  rake 
composition  receivers  to  combat  multipath  interference. 

Factor  9  (ad,  hoc,  routing,  nodes,  route,  topology,  mobility,  hosts)  -  focuses  on  an 
adaptive  network  routing  approach  to  assist  in  the  communication  of  multiple 
mobile  hosts  within  an  ad-hoc  network. 

Factor  10  (probability,  model,  distribution,  probabilities,  derived,  parameters, 
derive,  models,  analyze,  analytical,  expressions,  approximation,  outage)  -  focuses 
on  the  statistical  analysis  and  modeling  of  MAC  wireless  protocols  and  their  ability 
to  prevent  same-station  channels  from  colliding. 

Factor  11  (CDMA,  code,  division,  capacity,  mobile,  multiple,  code-division, 
cellular,  multiple-access,  base)  -  focuses  on  different  types  of  CDMA  (Code- 
Division  Multiple  Access)  cellular-mobile  and  wireless  communications. 

Factor  12  (keying,  modulation,  shift,  differential,  orthogonal,  phase,  frequency, 
QPSK,  fading,  quadrature,  codes,  error,  offset,  symbol)  -  focuses  on  D-DQPSK 
(Double  Differential  Quadrature  Phase  Shift  Keying)  being  applied  to  OFDM 
(Orthogonal  Frequency  Division  Multiplexing),  to  cut  down  on  the  disturbance  of 
signal  transmissions  by  splitting  the  signal  into  smaller  sub-signals. 
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4.3. 1.1. 1.1.2  Multiple-Word  Phrase  Factor  Matrix 

Factor  1  (MC-CDMA  system,  frequency  diversity,  OFDM,  MC-CDMA,  multipath 
fading  channels,  MAI,  frequency  selectivity,  users,  receiver,  fading  channels,  ISI, 
spreading  codes,  intersymbol  interference  ISI)  -  focuses  on  Multi-Carrier  CDMA 
(MC-CDMA)  systems  using  the  Orthogonal  Frequency  Division  Multiple  Access 
(OFDM)  to  resolve  the  frequency  selectivity  in  multipath  fading  channels. 

Factor  2  (Uplink  channels,  talkspurt,  integrated  voice,  voice  packets,  medium 
access  control  protocol,  voice  terminals,  time  slots,  data  packets,  collision, 
spreading  codes,  data  services)  -  focuses  on  a  MAC  protocol  for  integrated  voice 
and  data  services  in  which  the  uplink  channels  are  composed  of  time  slots  with 
multiple  spreading  codes  for  each  slot. 

Factor  3  (power  control  TPC,  fast  TPC,  average  BER,  fast,  antenna  diversity 
reception,  link  capacity,  Rake  combining,  coherent  Rake  combining,  pilot  symbols, 
E-b/N-0)  -  focuses  on  BER  (Bit  Error  Rate)  in  a  channel  without  a  fast  TPC 
(Transmit  Power  Control),  such  as  a  common  control  channel  in  a  real  multipath- 
fading  channel. 

Factor  4  (voice  communications,  dispersed  voice,  data  communications,  data 
transmissions,  random  access  protocol,  data  terminals,  voice,  voice  packet,  data 
traffic,  voice  terminals,  PRMA,  voice  traffic,  access)  -  focuses  on  a  TDMA-based 
wireless  network  that  provides  access  to  a  base  station  for  many  spatially  dispersed 
voice  and  data  terminals  in  an  attempt  to  integrate  voice  and  data  communications. 

Factor  5  (TDMA  scheme,  CDMA  scheme,  same  frequency  band,  impulse 
response,  mobile  terminal,  bit  error  rate  performance,  downlink,  DFE,  uplink, 
received  signals,  bit  error  rates)  -  focuses  on  a  TDMA/CDMA  network  that 
connects  mobile  terminals  to  a  LAN  through  a  radio  central  unit  and  provides  both 
uplink  and  downlink  unbalanced  data  rates  in  the  same  frequency  band. 

Factor  6  (bandwidth  allocation,  QoS  support,  real-time  applications,  mobile 
environment,  QoS,  wired  networks,  simulation  experiments,  admission  control, 
new  calls,  handoff  calls,  source,  call  admission  control,  on-demand)  -  focuses  on  a 
wireless  ad-hoc  network  that  is  able  to  ensure  QoS  (Quality  of  Service)  support  at 
all  times  within  a  mobile  environment  through  the  use  of  bandwidth  allocation. 
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Factor  7  (ad  hoc  networks,  nodes,  node,  mobile  nodes,  multicast  tree,  ad  hoc 
network,  on-demand,  topology,  protocols,  network,  mobile  hosts,  multicast)  - 
focuses  on  AD-HOC  networks  that  are  made  up  of  a  number  of  mobile  hosts 
without  a  centralized  system,  and  on  the  WMTP  (Wireless  Multicast  Tree 
Problem)  that  occurs  when  two  mobile  nodes  attempt  to  communicate  with  one 
another. 

Factor  8  (station,  throughput,  channel,  stations,  wireless  channel,  ALOHA, 
Rayleigh  fading,  hidden  terminals,  backoff  algorithm,  capture,  packet  length,  IEEE 
802.11  protocol,  capture  effect,  performance  measures,  protocol)  -  focuses  on 
improving  throughput  and  message  delay  delivery  in  a  conventional  slotted 
ALOHA  protocol  with  the  use  of  receiver  capture.  This  also  lessens  path  loss  and 
channel  fading. 

Factor  9  (TCP,  energy  efficiency,  wireless  links,  wireless  link,  transport,  ARQ, 
packet  size,  low  power,  Service  QoS  requirements,  energy,  throughput,  UDP)  - 
focuses  on  improving  TCP’s  (Transmission  Control  Protocol)  energy  efficiency  in 
transporting  data  over  wireless  links.  An  ARQ  (Automatic  Repeat  Request) 
mechanism  will  support  TCP  traffic  over  wireless  networks. 

Factor  10  (mobility  support,  wireless  ATM,  mobile  ATM,  location  management, 
mobility,  prototype  implementation,  handoff,  mobile  users,  multimedia,  PCS, 
ATM,  voice  services)  -  focuses  on  improving  the  mobility  of  ATM  networks 
through  the  use  of  handoff  and  location  management  schemes. 

Factor  11  (power  control  TPC,  voice  communications,  data  communications, 
dispersed  voice,  fast  TPC,  data  transmissions,  antenna  diversity  reception,  average 
BER,  coherent  Rake  combining,  link  capacity,  Rake  combining,  fast)  -  focuses  on 
the  use  of  fast  TPC  (Transmit  Power  Control)  and  coherent  Rake  combining  within 
a  W-CDMA  (Wideband  Carrier  Sense  Multiple  Acces)  protocol  to  increase  radio 
link  capacity  in  Data  and  Voice  based  communications. 

Factor  12  (handoff  calls,  new  calls,  cells,  cell,  base  stations,  neighboring  cells, 
shadowing,  reverse  link,  users,  base  station,  system  capacity)  -  focuses  on  a 
handoff  management  scheme  for  wireless  ATM  networks  in  which  all  cells  are 
connected  to  neighboring  cells  by  (PVC)  permanent  virtual  circuits.  Using  PVC  for 
handoff  calls  guarantees  fast  and  seamless  handoffs  without  necessarily  relying  on 
a  base  station. 

This  completes  the  specification  of  the  word  and  phrase  factor  matrices. 
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Now  that  the  twelve  factor  matrices  have  been  generated,  they  can  be  used  for 
word/  phrase  filtering  and  selection  for  myriad  applications.  The  desired 
application  at  this  point  is  filtering  words/  phrases  for  input  to  word/  phrase  limited 
multi-link  clustering  algorithms.  In  the  present  study,  the  735  words  and  704 
phrases  in  the  factor  matrices  have  to  be  culled  to  the  approximately  250  allowed 
by  the  Excel-based  clustering  package,  WINSTAT.  The  250  word  limit  is  an 
artifact  of  Excel.  Other  software  packages  may  allow  more  or  less  words  to  be 
used  for  clustering,  but  all  approaches  perform  culling  to  reduce  dimensionality. 
The  filtering  process  presented  here  is  applicable  to  any  level  of  filtered  words 
desired. 

The  factor  loadings  in  the  factor  matrix  were  converted  to  absolute  values.  Then,  a 
simple  algorithm  was  used  to  automatically  extract  those  high  factor  loading  words 
at  the  tail  of  each  factor.  If  word  variants  were  on  this  list  (e.g.,  singles  and 
plurals),  and  their  factor  loadings  were  reasonably  close  (Kostoff,  2003b),  they 
were  conflated  (e.g.,  ‘agent’  and  ‘agents’  were  conflated  into  ‘agents’,  and  their 
frequencies  were  added).  A  few  words  were  eliminated  manually,  based  on  factor 
loading  and  estimate  of  technical  content. 

4.2. 1.1. 1.2  Word  and  Phrase  Multi-Link  Clustering 

A  symmetrical  co-occurrence  matrix  of  the  253  highest  frequency  high  technical 
content  words  was  generated.  The  matrix  elements  were  normalized  using  the 
Equivalence  Index  (Eij=CijA2/Ci*Cj,  where  Ci  is  the  total  occurrence  frequency  of 
the  ith  phrase,  and  Cj  is  the  total  occurrence  frequency  of  the  jth  phrase,  for  the 
matrix  element  ij),  and  a  multi-link  clustering  analysis  was  performed  using  the 
WINSTAT  statistical  package.  The  Average  Linkage  hierarchical  aggregation 
method  was  used.  A  description  of  the  final  253  word  and  phrase  dendrograms  (a 
hierarchical  tree-like  structure),  and  the  aggregation  of  their  branches  into  a 
taxonomy  of  categories,  follows.  Appendix  9  is  the  dendrogram  of  the  253  words 
and  Appendix  10  is  the  dendrogram  of  the  253  phrases.  One  axis  is  the  words,  and 
the  other  axis  (‘distance’)  reflects  their  similarity.  The  lower  the  value  of 
‘distance’  at  which  words,  or  word  groups,  are  linked  together,  the  closer  their 
relation.  As  an  extreme  case  of  illustration,  words  that  tend  to  appear  as  members 
of  multi-word  phrases,  such  as  ‘Inter-Symbol  Interference’,  ‘Medium  Access 
Control’,  or  ‘multicast  tree’,  appear  adjacent  on  the  dendrogram  with  very  low 
values  of  ‘distance’  at  their  juncture.  The  capitalized  phrases  in  parentheses 
represent  cluster  boundary  phrases  for  each  category. 
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Experience  with  both  word  and  phrase  clustering  from  past  text  mining  studies  has 
shown  the  following.  Word  clustering  is  most  useful  in  determining  the  scope  and 
thematic  thrusts  of  the  higher  hierarchical  categories,  because  of  the  greater 
breadth  of  single  words.  Phrase  clustering  is  most  useful  in  determining  the 
thematic  thrusts  of  individual  lower  hierarchical  level  clusters,  because  of  the  more 
specific  nature  of  phrases.  Therefore,  only  the  single  words  will  be  used  to 
determine  the  themes  of  the  higher  and  broader  categorical  levels,  but  both  word 
and  phrase  clustering  will  be  used  to  determine  the  themes  of  the  lower  and  more 
focused  categorical  levels. 

4. 3. 1.1. 2.1  Word  Multi-Link  Clustering 

The  253  words  in  the  dendrogram  are  grouped  into  21  elemental  clusters.  These 
clusters  form  the  lowest  level  of  the  taxonomy  hierarchy.  Each  cluster  is  assigned 
a  letter,  ranging  from  A  to  U.  The  cluster  hierarchies  are  determined  by  the  branch 
structure  of  the  dendrogram.  At  the  highest  hierarchical  level,  there  are  two  main 
branches  (clusters).  Starting  from  the  phrase  adjoining  the  ‘distance’  ordinate,  the 
first  main  cluster  (A-E)  ranges  from  WIRELESS  to  RECEIVED.  The  second  main 
cluster  (F-U)  ranges  from  COMMUNICATIONS  to  2.4,  and  is  moderately  larger 
in  extent  than  the  first  main  cluster.  The  total  dendrogram  reflects  different  aspects 
of  increasing  Wireless  LANs’  throughput.  The  first  cluster  (A-E)  addresses 
protocols  (CDMA)  to  control  channel  packet  traffic  and  increase  throughput  and 
user  capacity,  as  well  as  efforts  at  the  physical  layer  to  reduce  interference 
limitations  on  throughput  of  WLANs.  The  second  cluster  (F-U)  covers  Network 
Architecture  and  Applications  designed  to  improve  throughput  and  reduce  latency 
effects  from  multi-path  signal  propagation,  including  Infrastructure,  protocols, 
hardware  and  applications.  Each  of  these  highest  level  clusters  will  be  divided 
and  sub-divided  into  smaller  clusters,  and  discussed. 

Cluster  (A-E)  can  be  divided  into  clusters  (A-B)  and  (C-E).  Cluster  (A-B)  ranges 
from  WIRELESS  to  ANALYTICAL,  and  cluster  (C-E)  ranges  from  POWER  to 
RECEIVED.  Cluster  (A-B)  focuses  on  the  simulation  of  data  transmissions, 
especially  using  CDMA  and  other  packet  traffic  schemes  affecting  throughput, 
delay,  and  user  accessibility.  Cluster  (C-E)  focuses  on  interference  issues  of  the 
physical  layer  and  on  ways  of  mitigating  them  (Rake  combining  and  antennas/ 
receivers). 

Cluster  (F-U)  can  be  divided  into  clusters  (F-P)  and  (Q-U),  where  cluster  (F-P)  is 
much  larger  than  cluster  (Q-U).  Cluster  (F-P)  ranges  from  COMMUNICATIONS 
to  DETECTOR,  and  cluster  (Q-U)  ranges  from  JOINT  to  2.4.  Cluster  (F-P)  focuses 
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on  network  hardware  and  architectures  and  methods  for  increasing  throughput 
(such  as  ATM  for  dynamic  bandwidth  allocation,  or  MAC  and  other  transport 
protocols).  In  addition,  some  applications  of  the  encompassed  technologies  are 
included.  Cluster  (Q-U)  focuses  on  signal  propagation,  and  especially  the  use  of 
QPSK  and  OFDM  within  CDMA  to  make  the  system  more  robust  to  effects  such 
as  multipath-induced  intersymbol  interference. 

(The  following  is  a  listing  of  the  lowest  level  word  clusters,  with  a  brief 
summarization  of  each  cluster  following  the  range) 

Cluster  A  (Wireless  to  Radio)  focuses  on  the  transmission  of  data  within  a  standard 
wireless  network  and  on  several  different  protocols  that  format  this  data. 

Cluster  B  (Access  to  Analytical)  focuses  on  CDMA  (code-division  multiple 
access)  slotted  ALOHA  schemes  aimed  at  improving  throughput  and  message 
delay  delivery  when  traffic  loads  are  sensed  within  a  channel. 

Cluster  C  (Power  to  Multipath)  focuses  on  the  use  of  power  control  to  improve  the 
BER  (Bit-error  rate)  within  multipath  Rayleigh  fading  channels. 

Cluster  D  (Diversity  to  Signal-to-interference)  focuses  on  DS-CDMA  schemes  that 
utilize  coherent  Rake  combining  and  SIR  (Signal-to-interference  power  ratio) 
measurement-based  fast  TPC  (Transmit  Power  Control). 

Cluster  E  (Noise  to  Received)  focuses  on  developing  algorithms  for  adaptive 
antenna  arrays  with  the  use  of  Gaussian  noise  for  testing  purposes. 

Cluster  F  (Communications  to  Range)  focuses  on  digital,  personal  communications 
at  a  global  level. 

Cluster  G  (Supporting  to  Handoff)  focuses  on  management  tools  that  can  support 
QoS  (Quality  of  service)  within  a  wide  range  of  multimedia  wireless  internet 
applications. 

Cluster  H  (ATM  to  Guarantee)  focuses  on  the  use  of  ATM  (Asynchronous 
Transfer  Mode)  and  bandwidth  allocation  to  increase  data  throughput. 

Cluster  I  (Voice  to  Layer)  focuses  on  Hop  Reservation  MAC  (Media  Access 
Control)  protocols  developed  for  packet-radio  networks  to  insure  collision  free 
data  transmissions. 
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Cluster  J  (Errors  to  Distributed)  focuses  on  TCP  (Transmission  Control  Protocol), 
a  reliable  transport  protocol  with  congestion  control  that  diminishes  the  number  of 
erroneous  packets  transmitted. 

Cluster  K  (Nodes  to  Scheduling)  focuses  on  ad-hoc  routing  protocols  where  nodes 
are  only  required  to  keep  track  of  their  own  position  and  hop-stations. 

Cluster  L  (Standards  to  Probability)  focuses  on  the  research  and  development  of 
third-generation  Telecommunication  system  platforms. 

Cluster  M  (Outage  to  Local)  focuses  on  the  output  and  delay  process  of  voice/data 
CDMA  (Code-Division  Multiple  Access)  that  gives  priority  to  local  voice  calls  by 
blocking  out  data  packets. 

Cluster  N  (Area  to  Integrated)  focuses  on  the  infrastructure  of  standard  IEEE 
802.1 1  wireless  local  area  networks. 

Cluster  O  (Circuits  to  Components)  focuses  on  CMOS  (Complementary  Metal 
Oxide  Semiconductor)  technology  in  circuitry  and  chips. 

Cluster  P  (Filters  to  Detector)  focuses  on  collision  resolution  through  slotted 
ALOHA  protocol. 

Cluster  Q  (Joint  to  Propagation)  focuses  on  DS-CDMA  (Direct-Sequence  Code- 
Division  Multiple  Access)  equipped  with  adaptive  antenna  arrays  offering  jointly 
effective  spatial  and  temporal  MAI  (Multiple  Access  Interference)  and  channel 
noise  suppression. 

Cluster  R  (Indoor  to  Modulation)  focuses  on  the  development  of  techniques  to 
lessen  ISI  (Intersymbol  Interference)  within  indoor  wireless  networks. 

Cluster  S  (QPSK  to  Experiments)  focuses  on  QPSK  (Quadrature  Phase  Shift 
Keying)  being  applied  to  OFDM  (Orthogonal  Frequency  Division  Multiplexing)  to 
improve  BER. 

Cluster  T  (WIDEBAND  to  FORWARD)  focuses  on  Field  and  Laboratory 
experiments  on  W-CDMA  (Wide-band  Code-division  Multiple  Access)  to  increase 
BER  (Bit  error  Rate)  by  applying  STTD  (Space  Time  Transmit  Diversity) 
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Cluster  U  (Correction  to  2.4)  focuses  on  ARQ  (Automatic-Repeat-Request) 
schemes  that  improve  throughput  for  high-speed  packets  through  transmission 
repetition. 

4.2. 1.1. 1.2.2  Phrase  Multi-link  Clustering 

The  253  Phrases  in  the  dendrogram  are  grouped  into  22  elemental  clusters.  These 
clusters  form  the  lowest  level  of  the  taxonomy  hierarchy.  Each  cluster  is  assigned 
a  letter,  ranging  from  A  to  V.  These  lowest  level  phrase  clusters  may  be 
interpreted  as  follows. 

Cluster  A  (SYSTEMS  to  ISI)  focuses  on  limiting  the  amount  of  ISI  (Inter-Symbol 
Interference)  within  wireless  networks. 

Cluster  B  (CDMA  SYSTEMS  to  BIT  ERROR  RATES)  focuses  on  both  CDMA 
(Code-Division  Multiple  Access)  and  TDMA  (Time-Division  Multiple  Access) 
schemes  for  minimizing  the  multiple  access  problem. 

Cluster  C  (MULTIPATH  to  FREQUENCY  SELECTIVITY)  focuses  on  CDMA’s 
(Code-Division  Multiple  Access)  and  OFDM’s  (Orthogonal  Frequency  Division 
Multiplexing)  performance  and  capacity  being  limited  by  MAI  (Multiple  Access 
Interference). 

Cluster  D  (RAYLEIGH  to  MULTIUSER  DETECTION)  focuses  on  the  simulated 
degradation  of  performance  of  DS  CDMA  (Direct  Sequence  Code  Division 
Multiple  Access)  systems  when  exposed  to  flat  Rayleigh  fading  channels. 

Cluster  E  (THROUGHPUT  to  ALOHA  PROTOCOL)  focuses  on  the  throughput 
performance  of  a  direct  sequence  spread  spectrum  (DS/SS)  slotted  ALOHA 
wireless  communication  network  system  with  delay  capture. 

Cluster  F  (CHANNEL  to  VOICE  ACTIVITY)  focuses  on  the  performance  of 
wireless  MAC  (Media  Access  Control)  protocols  with  Rayleigh  fading,  shadowing, 
and  capture  effect. 

Cluster  G  (STATIONS  to  ANALYTICAL  MODEL)  focuses  on  the  use  of 
analytical  models  to  study  standard  IEEE  802.11  protocols  with  a  dynamically 
tuned  backoff. 
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Cluster  H  (NUMERICAL  RESULTS  to  WAITING  TIME)  focuses  on  the  use  of 
CDMA  (Code-Division  Multiple  Access)  and  CSMA  (Carrier  Sense  Multiple 
Access)  protocols  to  improve  signal  throughput. 

Cluster  I  (NETWORK  NODES  to  MAXIMUM  DOPPLER  FREQUENCY) 
focuses  on  utilizing  innovative  Network  Nodes  to  allow  robust  electronic 
commerce  and  real-time  image/video  transmission. 

Cluster  J  (WIRELESS  LINKS  to  TCP  PERFORMANCE)  focuses  on  TCP 
(Transmission  Control  Protocol)  and  UDP  (User  Datagram  Protocol)  as  a  means  of 
sending  and  receiving  signals  over  an  IP  (Internet  Protocol)  network. 

Cluster  K  (BER  PERFORMANCE  to  PACKET  SIZE)  focuses  on  the  throughput 
and  energy-efficiency  of  TCP  (Transmission  Control  Protocol)  on  wireless  links 
using  ARQ  (Automatic  repeat  Request)  and  FEC  (Forward  Error  Correction) 
protocols. 

Cluster  L  (POWER  CONSUMPTION  to  BATTERY)  focuses  on  low  power 
consumption  as  being  a  key  design  metric  for  portable  wireless  network  devices 
where  battery  energy  is  a  limited  resource. 

Cluster  M  (PROTCOLS  to  BROADCAST)  focuses  on  a  medium  access  control 
protocol  for  a  packet  transmission  mobile  radio  network. 

Cluster  N  (NODES  to  CONNECTIVITY)  focuses  on  a  framework  for  dynamically 
organizing  mobile  nodes  in  wireless  ad  hoc  networks  into  clusters  in  which  the 
probability  of  path  availability  can  be  bounded. 

Cluster  O  (OVERHEAD  to  MECHANISM)  focuses  on  the  overhead  and  multicast 
tree  problems  within  standard  MANET  (Mobile  Ad  Hoc  Networks). 

Cluster  P  (SIMULATION  RESULTS  to  ADMISSION  CONTROL)  focuses  on 
several  simulated  schemes  based  on  resource  reservation  that  investigates  the 
performance  of  a  CDMA  cellular  system,  a  network  supporting  integrated  data 
transfer  and  providing  predictable  QoS  (Qualities  of  Service). 

Cluster  Q  (CELLS  to  COMMUNICATION)  focuses  on  future  personal 
communications  networks  supporting  network-wide  handoffs,  forced  call 
terminations  due  to  handoff  call  blocking,  and  the  previously  proposed  guard 
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channel  scheme  for  radio  channel  allocation  in  cellular  networks  reducing  handoff 
call  blocking  probability. 

Cluster  R  (ATM  to  MOBILE  USER)  focuses  on  the  use  of  ATM  (Asynchronous 
Transfer  Mode)  for  the  seamless  integration  of  advanced  multimedia  services  over 
both  fixed  and  mobile  networks  leading  to  fired-mobile  convergence. 

Cluster  S  (IP  to  MOBILE  COMMUNICATIONS)  focuses  on  the  concept  of 
mobile  ATM  (Asynchronous  Transfer  Mode),  a  proposal  for  third-generation 
mobile  communication  network  infrastructure  capable  of  supporting  flexible 
evolution  of  radio  technologies  from  today's  cellular  and  data  services  towards 
future  wireless  multimedia  services. 

Cluster  T  (WIRELESS  ACCESS  to  DATA)  focuses  on  the  use  of  broadband 
technology  to  increase  the  data  rate  for  multimedia,  messaging,  and  high-speed 
Internet  access. 

Cluster  U  (VOICE  to  EXPERIMENTAL  RESULTS)  focuses  on  the  performance 
of  DPRMA  (Dynamic  Packet  Reservation  Multiple  Access)  schemes  and  their 
ability  to  perform  well  in  a  system  with  voice,  video  conferencing,  and  data  users 
present. 

Cluster  V  (AVERAGE  BER  to  ACCESS)  focuses  on  an  outer  loop  control  method 
of  fast  TPC  (Transmit  Power  Control)  for  high-quality  data  transmission  such  as 
that  with  tile  average  bit  error  rate  (BER)  of  10(-6)  in  serial  concatenated  channel 
coding  combining  convolutional  and  Reed-Solomon  codings  for  DS-CDMA 
mobile  radio. 

At  this  point,  the  relationship  between  the  lowest  level  factors  and  clusters,  and 
their  aggregation  into  higher  level  category  clusters,  will  be  examined.  Table  9 
shows  the  assignment  of  factors  and  individual  clusters  to  the  second  level 
categories  of  the  multi-link  word  clustering-defined  taxonomy.  The  first  row 
represents  the  highest  taxonomy  level,  and  the  second  row  represents  the  next 
highest  taxonomy  level.  Each  taxonomy  category  in  the  second  row  includes  the 
column  in  which  it  is  written,  plus  the  adjacent  column  to  the  right. 

As  an  example,  under  the  first  level  category  (Improved  Throughput  protocols/ 
Physical  Layer  Limitations),  there  are  two  second  level  categories:  Improved 
Throughput  Protocols,  and  Physical  Layer  Throughput  Limitations.  Under  the 
second  level  category  of  Improved  Throughput  Protocols  are  word  clusters  (A,  B), 
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word  factors  (10,  11),  and  phrase  factors  (5).  There  is  good  correspondence 
between  the  factors  and  individual  clusters  within  the  higher  level  categories. 


TABLE  9  -  FACTOR  MATRIX- WORD/  PHRASE  CLUSTER  TAXONOMY 


CLUSTERS 

(A-E) 

IMPROVED  THROUGHPUT  PROTOCOLS/ 
PHYSICAL  LAYER  LIMITATIONS 

CLUSTERS 

(F-U) 

IMPROVED  THROUGHPUT 
ARCHITECTURES 

CLUSTERS 

(A-B) 

IMPROVED 

THROUGHPUT 

PROTOCOLS 

CLUSTERS 

(C-E) 

PHYSICAL  LAYER 
THROUGHPUT 
LIMITATIONS 

CLUSTERS 

(F-P) 

IMPROVED 

THROUGHPUT 

ARCHITECTURES 

CLUSTERS 

(Q-U) 

SIGNAL 
PROPAGATION 
PROTOCOLS  TO 
REDUCE  LATENCY 

FACT 

WRD  PHR 

FACT 

WRD  PHR 

FACT 

WRD  PHR 

FACT 

WRD  PHR 

10  5 

8  8 

2  2 

1  1 

11 

3  3 

4 

5  4 

12 

CD 

CD 

7  7 

9  9 

10 

11 

12 

4.2. 1.1.2  Document  Clustering 

Document  clustering  is  the  grouping  of  similar  documents  into  thematic  categories. 
Different  approaches  exist  (e.g.,  Willett,  1988;  Rasmussen,  1992;  Cutting  et  al, 
1992;  Guha  et  al,  1998;  Hearst,  1998;  Zamir  and  Etzioni,  1998;  Karypis  et  al, 
1999;  Steinbach  et  al,  2000).  Two  approaches  were  examined  in  this  paper: 
Greedy  String  Tiling,  and  Partitional  Clustering. 

4.2.1. 1.2.1  Greedy  String  Tiling 

4.2. 1.1.2. 1.1  Greedy  String  Tiling  Approach 
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The  approach  presented  in  this  section  is  based  on  a  Greedy  String  Tiling  (GST) 
text  matching  algorithm  (Wise,  1992;  Prechelt  et  al,  2002).  It  is  described  in  some 
detail  in  Appendix  1 .  Basically,  GST  clustering  forms  groups  of  documents  based 
on  the  cumulative  sum  of  shared  strings  of  words.  Each  group  is  termed  a  cluster, 
and  the  number  of  records  in  each  cluster,  and  the  highest  frequency  technical 
keywords  in  each  cluster,  are  two  outputs  central  to  this  analysis.  The  shared 
strings  for  any  pair  of  documents  form  the  basis  for  a  similarity  metric  for  the  pair. 
Document  membership  within  a  cluster  is  determined  by  the  similarity  threshold 
selected. 

4.2. 1.1.2. 1.2  Greedy  String  Tiling  Results 

The  similarity  metric  threshold  value  was  varied  from  one  to  twenty  percent, 
within  a  complete  link  clustering  algorithm.  Based  on  a  balance  between  cluster 
size  and  cohesiveness,  a  four  percent  similarity  threshold  was  selected.  The  four 
percent  similarity  threshold  produced  a  total  of  167  clusters.  The  60  largest 
clusters,  containing  1521  Abstracts,  were  extracted,  and  are  listed  by  size  in 
Appendix  11,  in  descending  numerical  order.  The  main  keywords  from  each 
cluster  are  shown  in  parentheses  after  the  cluster  number,  and  the  number  of 
records  in  each  cluster  is  shown  in  parenthesis  before  the  cluster  number.  The 
keywords  are  arranged  in  frequency  of  appearance,  in  descending  order.  Three 
levels  of  fdtering  were  used  to  obtain  the  main  keywords  shown.  First,  a  trivial 
word  list  (e.g.,  of,  the,  on,  etc)  was  applied  to  the  raw  data.  Second,  only  the 
highest  frequency  words  for  each  cluster  were  retained.  Third,  a  manual  fdtering 
was  performed  on  the  thirty  highest  words  to  eliminate  low  technical  content 
words.  The  themes  of  each  cluster  follow  each  grouping  of  words.  The  taxonomy 
based  on  these  themes  is  listed  in  the  section  titled  Word  Clustering  Taxonomy, 
located  at  the  end  of  the  document  clustering  section. 

4.2.1. 1.2.2  Partitional  Clustering 

4.2.1. 1.2.2. 1  Partitional  Clustering  Approach 

The  approach  presented  in  this  section  is  based  on  a  partitional  clustering 
algorithm  (Zhao  and  Karypis,  2004)  contained  within  a  software  package  named 
CLUTO.  Most  of  CLUTO’s  clustering  algorithms  treat  the  clustering  problem  as 
an  optimization  process  that  seeks  to  maximize  or  minimize  a  particular  clustering 
criterion  function  defined  either  globally  or  locally  over  the  entire  clustering 
solution  space.  CLUTO  uses  a  randomized  incremental  optimization  algorithm 
that  is  greedy  in  nature,  and  has  low  computational  requirements.  Twenty-one 
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individual  clusters  were  chosen  to  correlate  with  the  twenty-one  manually 
determined  clusters  from  earlier  in  this  paper.  The  twenty-one  clusters  are  listed  in 
Appendix  12.  Each  cluster  is  numbered  (beginning  with  zero),  and  the  number  of 
documents  In  each  cluster  appears  at  the  beginning  of  every  cluster  in  parentheses. 
In  parentheses  are  the  most  descriptive  words  in  each  cluster.  Each  word  within 
the  cluster  is  followed  by  a  number  that  represents  the  percentage  of  intra-cluster 
similarity  explained  by  the  word. 

4.4.2.23.4  Word  Clustering  Taxonomy 


The  taxonomy  defined  by  the  word  clustering  algorithm  is  used  to  categorize  the 
document  clusters.  Each  document  cluster  was  assigned  to  the  most  appropriate 
category  in  the  taxonomy  defined  by  the  WINSTAT-generated  dendrogram  of  the 
last  section,  based  on  the  theme  suggested  by  the  highest  frequency  technical 
keywords.  The  number  of  records  in  each  taxonomy  category  from  all  the  clusters 
in  the  category  was  calculated,  and  is  shown  in  Table  10  for  the  Greedy  String 
Tiling  Results.  The  same  Taxonomy  was  done  for  the  CLUTO  clustering  results  in 
Table  1 1.  In  these  Tables,  the  top  two  levels  of  the  taxonomy  are  presented.  The 
top  hierarchical  level  is  composed  of  Improved  Throughput  Protocols/ 

Physical  Layer  Limitations  and  Improved  Throughput  Architectures,  and  the 
second  hierarchical  level  is  composed  of  Improved  Throughput  Protocols,  Physical 
Layer  Throughput  Limitations,  Improved  Throughput  Architectures,  and  Signal 
Propagation  Protocols  To  Reduce  Latency.  The  first  column  is  the  cluster  number, 
and  the  matrix  elements  are  the  number  of  records  in  the  cluster  in  the  specific 
second-level  taxonomy  category.  The  numbers  in  each  second  level  category  are 
summed,  and  are  summed  in  turn  to  give  the  total  number  of  documents  in  each  of 
the  two  first  level  categories. 

4.4.2.2.3.4.1  GREEDY  STRING  TILING  TAXONOMY 


Table  10 


Cluster  # 

CLUSTERS 

(A-E) 

IMPROVED  THROUGHPUT  PROTOCOLS/ 
PHYSICAL  LAYER  LIMITATIONS 

CLUSTERS 

(F-U) 

IMPROVED  THROUGHPUT 
ARCHITECTURES 

CLUSTERS 

CLUSTERS 

CLUSTERS 

CLUSTERS 

(A-B) 

(C-E) 

(F-P) 

(Q-U) 

IMPROVED 

PHYSICAL  LAYER 

IMPROVED 

SIGNAL 

THROUGHPUT 

THROUGHPUT 

THROUGHPUT 

PROPAGATION 

PROTOCOLS 

LIMITATIONS 

ARCHITECTURES 

PROTOCOLS  TO 
REDUCE  LATENCY 
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Approximately  65%  of  the  records  in  the  60  GST  clusters  fall  into  the  higher  level 
category  (F-U),  Improved  Throughput  Architectures.  The  other  35%  fall  into 
category  (A-E),  Improved  Throughput  Protocols/  Physical  Layer  Limitations. 
Category  (A-E)  breaks  into  two  second  level  groups,  46%  into  clusters  (C-E), 
Physical  Layer  Throughput  Limitations,  and  54%  into  clusters  (A-B),  Improved 
Throughput  Protocols.  Category  (F-U)  also  divides  into  two  second  level  groups, 
79%  into  clusters  (F-P),  Improved  Throughput  Architectures,  and  21%  into 
clusters  (Q-U),  Signal  Propagation  Protocols  to  Reduce  Latency. 


4.4.2.23.4.2  CLUTO  CLUSTERING  TAXONOMY 


Table  1 1 

Cluster#  CLUSTERS  CLUSTERS 

(A-E)  (F-U) 

IMPROVED  THROUGHPUT  PROTOCOLS/  IMPROVED  THROUGHPUT 

PHYSICAL  LAYER  LIMITATIONS  ARCHITECTURES 


CLUSTERS  CLUSTERS  CLUSTERS  CLUSTERS 

(A-B)  (C-E)  (F-P)  (Q-U) 

IMPROVED  PHYSICAL  LAYER  IMPROVED  SIGNAL 

THROUGHPUT  THROUGHPUT  THROUGHPUT  PROPAGATION 

PROTOCOLS  LIMITATIONS  ARCHITECTURES  PROTOCOLS  TO 

REDUCE  LATENCY 

0  ~  26  ~ 

1 _ 12 _ 

2 _ 18 _ 

3  19 
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Approximately  63%  of  the  60  CLUTO  clusters  fall  into  the  higher  level  category 
(F-U),  Improved  Throughput  Architectures.  The  other  37%  fall  into  category  (A- 
E),  Improved  Throughput  Protocols/  Physical  Layer  Limitations.  Cluster  (A-E) 
breaks  into  two  second  level  groups,  49%  into  clusters  (C-E),  Physical  Layer 
Throughput  Limitations,  and  51%  into  clusters  (A-B),  Improved  Throughput 
Protocols.  Cluster  (F-U)  also  divides  into  two  second  level  groups,  76%  into 
clusters  (F-P),  Improved  Throughput  Architectures,  and  24%  into  clusters  (Q-U), 
Signal  Propagation  Protocols  to  Reduce  Latency. 

In  the  previous  taxonomy  sections,  the  higher  level  categories  were  defined  by  the 
results  of  the  word  clustering  algorithm.  Clusters  generated  by  the  two  document 
clustering  algorithms  were  then  assigned  to  these  higher  level  categories  for 
comparison  of  level  of  effort  determination. 

In  the  following,  the  higher  level  categories  are  defined  by  the  results  of  the 
partitioning  algorithms,  and  the  themes  within  these  categories  are  determined  by 
their  elemental  clusters.  The  slightly  modified  perspective  of  the  Wireless  LANs 
database  is  now  summarized. 

The  major  total  database  theme  is  research  on  methods  to  improve  throughput 
while  maintaining  Quality-of-Service.  In  all  three  clustering  approaches  used  in 
this  study,  the  top-level  categorization  appeared  to  consist  of  two  moderately 
distinct  categories.  One  (Improved  Throughput  from  Physical  Layer  Advances) 
addresses  digital  spread  spectrum  modulation  techniques  (CDMA)  to  increase 
throughput  and  user  capacity,  and  the  associated  efforts  at  the  physical  layer  to 
reduce  inherent  interference  limitations  of  CDMA  on  system  capacity.  The  other 
(Improved  Throughput  from  Protocols  and  Architectures)  covers  Network 
Architecture  and  Applications  designed  to  improve  throughput,  including 
infrastructure,  protocols,  hardware  and  applications. 

Level  1 

IMPROVED  THROUGHPUT  FROM  PHYSICAL  LAYER  ADVANCES 

Reduction  of  multiple  access  interference  suppression  in  the  spatial  domain 
and  in  the  code  domain  for  CDMA  systems,  and  physical  approaches  to 
reduce  fading  losses  and  restore  signal  degradation  that  occurred  at  the 
physical  layer. 
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IMPROVED  THROUGHPUT  FROM  PROTOCOLS  AND 
ARCHITECTURES 

Protocols  for  mobile  wireless  voice  and  data  traffic  ATM  networks  to  avoid 
packet  collisions,  reduce  call  dropping,  reduce  delays,  and  increase 
throughput  while  maintaining  Quality  of  Service. 

At  the  next  categorization  level,  four  categories  can  be  discerned.  These  categories 
(underlined),  and  their  key  themes  (bulletized)  are  as  follows: 

Level  2 

SIGNAL  QUALITY  MAINTENANCE 

Signal  degradation  at  the  physical  layer,  and  use  of  electronics  technology  to 
restore  and  maintain  signal  quality. 

•  Optical  wave  reflections  off  walls  and  columns  inside  buildings,  and 
propagation  measurements 

•  Antennas,  especially  micro-strip  patch  and  other  arrays,  and  simulated/ 
measured  beam  radiation  patterns  for  reduced  electromagnetic  interference 

•  Power  amplifiers  in  the  Ghz  frequency  range  for  wireless  LANs,  especially 
those  using  CMOS  hardware  or  circuitry 

CDMA  INTERFERENCE  REDUCTION 

Reduction  of  multiple  access  interference  suppression  in  the  spatial  domain 
and  in  the  code  domain  for  CDMA  systems,  and  physical  approaches  to 
reduce  fading  losses 

•  OFDM  (Orthogonal  Frequency-Division  Multiplexing),  especially  degradation 
from  frequency  offsets,  and  symbol  modification  to  cut  down  on  ISI  (Inter- 
Symbol  Interference)  while  benefiting  from  reduced  path  loss 

•  Use  of  DS\CDMA  (Direct-Sequence)  Code-Division  Multiple  Access)  systems 
that  improve  BER  (Bit  Error  Rate)  within  Rayleigh  multi-path  fading  channels. 
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•  Diversity,  by  the  use  of  antenna  diversity  reception  and  coherent  Rake 
combiners,  in  spread- spectrum  communications  over  fast-fading  multi-path 
channels  for  increasing  throughput 

•  Channel  estimation  for  interference-limited  channels,  and  multi-user  detectors 
for  co-channel  interference  cancellation 

•  CDMA  (Code-Division  Multiple  Access)  to  increase  system  capacity  by 
reducing  interference  from  other  users  in  a  cell 

•  Use  of  signal-to-interference  power  ratio  (SIR)  measurement  based  fast  transmit 
power  control  (TPC)  to  reduce  the  minimum  transmit  power  according  to  the 
channel  load  and  the  changes  in  the  link  conditions  due  to  fading 

PACKET  TRAFFIC  PROTOCOLS 

Access  protocols  for  voice  and  data  traffic  that  increase  throughput  and 

reduce  delays  by  packet  collision  avoidance  (e.g.,  CSMA). 

•  TCP  (Transmission  Control  Protocols)  employing  ARQ/FEC  (Automatic 
Repeat  Request/Forward  Error  Correction)  protocols,  to  reduce  packet  error 

•  MAC  (Medium  Access  Control)  Protocols,  especially  TDMA-based,  that 
efficiently  accommodate  real-time  voice  and  video  traffic  with  dynamic 
bandwidth/  resource  allocation 

•  Access  protocols  for  integrated  voice  and  data  traffic  that  overcome  the  heavy 
load  degraded  performance  due  to  the  contention  mechanism  of  random  access 
in  PRMA  (Packet  Reservation  Multiple  Access)  protocols. 

•  CSMA  (Carrier- Sense  Multiple  Access)  protocols  that  increase  throughput  by 
focusing  on  packet  collision  avoidance 

NETWORK  ARCHITECTURES 

Architectures  and  routing  protocols  for  mobile  ad  hoc  wireless  ATM 

networks  to  improve  Quality  of  Service. 
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•  Standard  IEEE  802.11  MAC  (Medium  Access  Control)  protocols  for  wireless 
LANs 

•  Standard  Bluetooth  IEEE  802.1 1  wireless  LAN  devices  and  technology 

•  Providing  Internet  services,  with  enhanced  performance  Internet  protocols  over 
wireless  networks,  to  mobile  communications  systems  users 

•  Architectures  maintaining  guaranteed  levels  of  QoS  (Quality  of  Service)  within 
wireless  mobile  ATM  (Asynchronous  Transfer  Mode)  networks 

•  Handoffs  and  connection  path  rerouting  to  avoid  cell  loss  to  mobile  users  in 
ATM  (Asynchronous  Transfer  Mode)  networks 

•  Call  handoff  management  schemes  to  minimize  dropping  probability  of  handoff 
calls  and  blocking  probability  of  new  calls 

•  Routing  protocols  for  wireless  ad  hoc  mobile  networks 

•  Multicasting,  especially  in  mobile  ad  hoc  networks  with  multiple  hosts,  and  the 
use  of  multicast  delivery  trees  under  mobile  host  conditions 

Approximately  46%  of  the  CLUTO  cluster  documents  are  assigned  by  the 
partitioning  clustering  algorithm  to  the  first  higher  level  category,  Improved 
Throughput  from  Physical  Laver  Advances,  while  the  other  54%  are  assigned  to 
the  second  higher  level  category,  Improved  Throughput  from  Protocols  and 
Architectures.  The  category  Improved  Throughput  from  Physical  Layer  Advances 
breaks  into  two  second  level  categories,  with  30%  of  its  documents  assigned  to  the 
category  Signal  Quality  Maintenance,  and  70%  assigned  to  category  CDMA 
Interference  Reduction.  The  other  first  level  cluster,  Improved  Throughput  from 
Protocols  and  Architectures,  also  divides  into  two  second  level  groups,  with  36% 
of  its  documents  assigned  to  category  Packet  Traffic  Protocols,  and  64%  of  its 
records  assigned  to  category  Network  Architectures. 

Finally,  Appendix  14  contains  the  Keywords  for  Wireless  LANs  documents 
retrieved  from  1991-2005.  The  Keywords  can  be  viewed  as  specific  technology 
thrust  areas,  with  specificity  increasing  as  frequency  decreases. 
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4.  SUMMARY  AND  CONCLUSIONS 

(See  Executive  Summary  at  Beginning  of  Report) 
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APPENDIX  1  -  GREEDY  STRING  TILING  (GST)  CLUSTERING 


Greedy  String  Tiling  (GST)  clustering  is  a  method  of  grouping  text  or  text 
character  documents  (files)  by  similarity.  All  documents  to  be  grouped  are  placed 
in  a  database.  Each  pair  of  documents  is  compared  by  GST,  an  algorithm 
originally  used  to  detect  plagiarism  [Wise,  1992;  Prechelt  et  al,  2002],  and  a 
similarity  score  is  assigned  to  the  pair.  Then,  hierarchical  aggregation  clustering 
[Rasmussen,  1992;  Steinbach,  2000]  is  performed  on  all  the  documents,  using  the 
similarity  score  for  group  assignment. 

Greedy  String  Tiling  computes  the  similarity  of  a  pair  of  documents  in  two  phases. 
First,  all  documents  to  be  compared  are  parsed,  and  converted  into  token  strings 
(words  or  characters).  Second,  these  token  strings  are  compared  in  pairs  for 
determining  the  similarity  of  each  pair.  During  each  comparison,  the  GST 
algorithm  attempts  to  cover  one  token  string  (document)  with  sub-strings  (‘tiles’) 
taken  from  the  other  string.  These  sub-strings  are  not  allowed  to  overlap,  resulting 
in  a  one  to  one  mapping  of  tokens.  The  attribute  ‘greedy’  stems  from  the  fact  that 
the  algorithm  matches  the  longest  sub-strings  first. 

A  number  of  similarity  metrics  can  be  defined  once  the  tiling  is  completed.  One 
similarity  metric  is  the  percentage  of  both  token  strings  that  is  covered.  Another 
similarity  metric  is  the  absolute  number  of  shared  tokens.  A  third  similarity  metric 
is  the  mutual  information  index.  Depending  on  the  purpose  of  the  matching, 
additional  weightings  can  be  used  for  the  similarity  matrix  to  increase  the  ranking 
precision.  For  example,  if  plagiarism  is  one  study  objective,  additional  weighting 
could  be  given  to  shared  string  length.  All  similarity  metrics  have  positive  and 
negative  features,  and  the  choice  of  metric  is  somewhat  influenced  by  the  study 
objectives  and  the  structure  of  the  database. 

Once  the  document  similarity  matrix  has  been  generated,  myriad  clustering 
techniques  can  be  used  to  produce  a  classification  scheme  (taxonomy).  In  the 
present  study,  multi-link  hierarchical  aggregation  was  used.  Three  clustering 
variants  were  actually  generated,  although  the  extension  to  other  clustering 
schemes  is  straight-forward.  Single-link,  average-link,  and  complete-link  variants 
are  implemented.  The  variants  differ  in  how  the  decision  of  merging  to  clusters  is 
made.  Single-link  requires  that  the  similarity  of  at  least  two  documents  is  higher 
than  a  certain  threshold,  while  complete-link  requires  that  the  similarity  between 
all  documents  in  both  clusters  be  higher  than  a  threshold.  Average-link  requires 
that  the  average  pair-wise  similarity  between  the  documents  of  both  clusters 
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exceeds  the  threshold.  For  the  present  study,  complete-link  appeared  to  give  good 
results,  and  was  the  clustering  method  used. 
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Appendix  2  -  CLUTO  Clustering 


CLUTO  [Karypis,  2002]  is  a  software  package  that  implements  various  algorithms 
for  clustering  low-  and  high-dimensional  datasets  and  for  analyzing  the 
characteristics  of  the  various  clusters.  CLUTO  implements  three  different  classes 
of  clustering  algorithms  that  can  operate  either  directly  in  the  object's  feature  space 
or  in  the  object's  similarity  space.  The  clustering  algorithms  provided  by  CLUTO 
are  based  on  the  partitional,  agglomerative,  and  graph-partitioning  paradigms. 
CLUTO's  partitional  and  agglomerative  algorithms  are  able  to  find  clusters  that  are 
primarily  globular,  whereas  its  graph-partitioning  and  some  of  its  agglomerative 
algorithms  are  capable  of  finding  transitive  clusters. 

In  this  study,  documents  were  clustered  using  the  partitional  clustering  algorithms 
provided  by  CLUTO.  Partitional  clustering  algorithms  find  the  clusters  by 
partitioning  the  entire  document  collection  into  a  predetermined  number  of  disjoint 
sets,  each  corresponding  to  a  single  cluster.  This  partitioning  is  achieved  by 
treating  the  clustering  process  as  an  optimization  procedure  that  tries  to  create  high 
quality  clusters  according  to  a  particular  function  that  reflects  the  underlying 
definition  of  the  “goodness”  of  the  clusters.  This  function  is  referred  to  as  the 
clustering  criterion  function.  CLUTO  implements  seven  such  criterion  functions 
that  measure  various  aspects  of  intra-cluster  similarity,  inter-cluster  dissimilarity, 
and  their  combinations,  and  have  been  shown  to  produce  high-quality  clusters  in 
low-  and  high-dimensional  datasets  [Zhao  and  Karypis,  2004], 

CLUTO  uses  two  different  methods  for  computing  the  partitioning  clustering 
solution.  The  first  method  computes  a  £-way  clustering  solution  via  a  sequence  of 
repeated  bisections,  whereas  the  second  method  computes  the  solution  directly  (in 
a  fashion  similar  to  traditional  K-means-based  algorithms).  These  methods  are 
often  referred  to  as  repeated  bisecting  and  direct  A -way  clustering,  respectively. 
CLUTO  computes  a  direct  A:- way  clustering  as  follows.  Initially,  a  set  of  k  objects 
is  selected  from  the  datasets  to  act  as  the  seeds  of  the  k  clusters.  Then,  for  each 
object,  its  similarity  to  these  k  seeds  is  computed,  and  it  is  assigned  to  the  cluster 
corresponding  to  its  most  similar  seed.  This  forms  the  initial  k- way  clustering.  This 
clustering  is  then  repeatedly  refined  so  that  it  optimizes  a  desired  clustering 
criterion  function.  This  optimization  is  performed  using  a  randomized  incremental 
optimization  algorithm  that  is  greedy  in  nature,  has  low  computational 
requirements,  and  produces  high-quality  solutions  [Zhao  and  Karypis,  2004].  A  k- 
way  partitioning  via  repeated  bisections  is  obtained  by  recursively  applying  the 
above  algorithm  to  compute  2-way  clustering  (i.e.,  bisections).  Initially,  the  objects 
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are  partitioned  into  two  clusters,  then  one  of  these  clusters  is  selected  and  is  further 
bisected,  and  so  on.  This  process  continues  k  -  1  times,  leading  to  k  clusters.  Each 
of  these  bisections  is  performed  so  that  the  resulting  two-way  clustering  solution 
optimizes  a  particular  criterion  function. 

The  actual  documents  were  represented  with  the  widely-used  vector-space  model. 
The  various  terms  present  in  the  documents  were  used  to  define  a  high¬ 
dimensional  space  and  each  document  was  considered  to  be  a  vector  in  that  space. 
However,  unlike  the  traditional  vector-space  representation,  which  relies  entirely 
on  single  terms,  all  consecutive  two-  and  three-word  combinations  were  taken  into 
account,  resulting  in  a  representation  that  is  capable  of  capturing  the  phrases 
commonly  occurring  in  the  documents.  In  addition,  Porter’s  stemming  algorithm 
was  used  to  pre-process  the  various  terms  of  each  document  prior  to  obtaining  their 
vector-space  representation.  The  weight  of  each  dimension  was  computed  using 
the  TF-IDF  model  in  which  terms  that  occur  many  times  within  a  document  are 
given  higher  weight  (TF)  and  terms  that  occur  across  many  documents  were  given 
lower  weight  (IDF)  [Zhao  and  Karypis,  2004],  The  similarity  between  two 
documents  was  measured  using  the  cosine  of  their  corresponding  document 
vectors. 
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Appendix  3  -  Word/  Phrase  Clustering  Methodology 

Phrase  clustering  is  the  grouping  of  word  strings  by  similarity  to  some  metric. 
When  applied  to  technical  literature  (unstructured  technical  text),  it  produces 
groups  of  technical  thrusts.  The  phrase  members  of  each  thrust  grouping  are 
related  to  the  group’s  theme.  When  applied  to  a  bibliometric  list,  such  as  a  list  of 
technical  paper  authors,  it  produces  groups  of  bibliometric  quantities  (e.g., 
authors).  In  the  author  case,  the  authors  of  each  group  would  have  some  common 
similarity,  depending  on  the  metric  chosen.  For  example,  the  groups  could  be 
defined  by  people  who  publish  together,  or  people  who  work  at  the  same 
institution,  or  people  who  work  in  the  same  topical  area. 

In  the  present  paper,  two  phrase  clustering  approaches  are  used.  In  both 
approaches,  text  to  be  analyzed  is  assembled,  either  from  unstructured  free  text 
(e.g.,  reports,  databases  with  free  text  fields,  etc),  or  databases  with  bibliometric 
fields  (e.g.,  databases  of  journal  papers  with  author  name  fields,  author  institution 
fields,  journal  fields,  etc).  The  text  to  be  analyzed  is  then  converted  to  phrases, 
with  a  frequency  of  occurrence  associated  with  each  phrase.  The  physical 
locations  of  the  phrases  in  the  source  documents  are  retained,  so  that  this  co¬ 
occurrence  information  can  be  exploited  in  the  grouping  process. 

3A.  Statistical  Clustering 

The  statistical  clustering  approaches  start  with  the  phrase  frequency  and  location 
information,  then  group  phrases  based  on  their  co-occurrence  frequencies  and  other 
important  numerical  indicators.  In  previous  studies  by  the  first  author,  the 
statistical  clustering  has  used  two  software  packages  in  parallel.  In  the  present 
report,  as  discussed  in  the  main  body  of  the  text,  the  factor  matrix  approach  is 
performed  using  one  software  package  only,  but  the  multi-link  clustering  approach 
still  uses  both  packages.  Additionally,  the  two  approaches  are  performed  in 
tandem,  where  the  factor  matrix  filtering  can  be  exploited  for  clustering  phrase 
selection. 

The  phrase  frequency  and  location  information  is  generated  by  TechOasis,  a 
software  package  from  Search  Technology.  TechOasis  uses  Natural  Language 
Processing  (NLP)  to  extract  the  phrases  and  their  frequencies  from  the  free  text. 
Not  all  phrases  are  extracted  (typical  of  all  NLP),  and  manual  cleanup  of  the 
extracted  phrases  is  still  required  to  eliminate  the  lower  technical  content  phrases. 
Also,  unlike  the  multiple  counting  of  nested  phrases  in  TextSlicer,  multiple 
counting  of  nested  phrases  in  TechOasis  is  disallowed  (i.e.,  METAL  MATRIX 
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would  be  counted  once  as  a  double  word  phrase,  but  neither  METAL  nor 
MATRIX  would  be  counted  as  single  word  phrases).  In  previous  studies,  the 
resulting  phrase  frequency  and  co-occurrence  matrix  information  was  then 
exported  from  TechOasis  to  WINSTAT  (an  Excel  add-in  software  package)  for 
both  approaches  (described  here).  Two  types  of  output  were  generated  by 
WINSTAT:  factor  matrix  and  multi-link  clustering.  Each  of  these  processes  will 
now  be  described. 

3B.  Factor  Matrix  Clustering 

Factor  matrix  clustering  generates  a  correlation  matrix  from  the  frequency  and 
location  information  of  the  phrases.  It  then  generates  factors  that  are  composed  of 
all  the  phrases  in  the  correlation  matrix.  The  phrases  are  ordered  quantitatively  by 
their  correlation  with  each  other,  with  the  most  strongly  correlated  assigned  the 
highest  quantitative  values. 

Appendix  4  shows  an  author  factor  matrix.  The  grouping  metric  was  the  number 
of  co-authored  publications,  taking  into  account  the  total  number  of  publications  of 
each  author.  In  order  to  display  all  factors  on  one  page  width,  the  column  widths 
had  to  be  shrunk  drastically.  The  matrix  element  values  were  hidden,  but  the 
shadings  could  be  displayed,  and  they  represent  high  correlation  values  (factor 
loadings).  The  darker  the  shading,  the  higher  the  value. 

The  number  of  factors  generated  can  be  fully  discretionary  (e.g.,  select  a  ten  factor 
matrix,  with  no  floor  on  the  eigenvalues),  or  partially  discretionary  (e.g.,  set  an 
eigenvalue  floor  of  unity,  let  the  algorithm  generate  the  final  number  of  factors). 
Final  taxonomies  can  be  generated  by  manually  combining  factors  into  categories, 
or  specifying  the  number  of  categories  (factors)  desired  to  the  algorithm. 

The  author  factor  matrix  in  Appendix  4  was  generated  by  selecting  an  eigenvalue 
floor  of  unity.  In  practice,  this  means  that  each  factor  generated  will  add  new 
information.  Analysis  of  the  author  factor  matrix  in  Appendix  4  is  explained  in  the 
author  bibliometrics  section  of  the  text,  and  will  not  be  repeated  here. 

A  phrase  factor  matrix  looks  identical  to  the  author  factor  matrix,  with  the 
exception  that  the  author  names  are  replaced  by  technical  phrases  extracted  from 
the  free  text  field  analyzed.  Appendix  7  contains  a  word  factor  matrix,  and 
Appendix  8  contains  a  phrase  factor  matrix. 
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3C.  Multi-Link  Clustering 

Phrase  multi-link  clustering  starts  with  pre-defined  metrics  (variables),  and  groups 
phrases  (cases)  based  on  the  strength  of  their  relationships  with  these  metrics.  For 
example,  if  a  symmetric  phrase-phrase  matrix  is  used  as  the  basis  for  clustering, 
then  the  closeness  of  two  phrases  (cases)  will  be  determined  by  their  co-occurrence 
profile  with  all  other  phrases  (variables).  The  clustering  becomes  a  co-occurrence 
profile  matching  process.  The  WINSTAT  software  package  produces  three  types 
of  related  outputs. 

1)  A  dendrogram  that  shows  the  quantitative  linkages  among  closely-related 
phrases.  Figure  3C-1,  for  example,  is  a  dendrogram  that  portrays  linkages 
among  the  twenty  highest  frequency  technical  content  phrases  from  a  database 
of  Electrochemistry  paper  Abstracts. 

A  dendrogram  is  a  tree-like  structure  that  shows  linkages  among  phrases.  It 
does  so  by  starting  with  a  root  that  encompasses  all  the  phrases  (See  the  vertical 
line  on  Figure  3C-1  ranging  from  a  Distance  value  of  42  to  slightly  over  40). 
Then  it  splits  into  two  groups  (clusters)  until  all  the  phrases  are  contained  in 
their  own  cluster.  In  Figure  3C-1,  the  root  at  the  bottom  of  the  page 
encompasses  all  the  phrases.  The  first  split  is  into  two  large  clusters.  One 
cluster  contains  the  phrases  ELECTROLYTE,  CELL,  CATHODE,  ANODE, 
and  CELLS.  The  second  cluster  contains  all  the  remaining  phrases  LITHIUM, 
ELECTROCHEMICAL  PROPERTIES,  CYCLIC  VOLTAMMETRY,  X-RAY 
DIFFRACTION,  ELECTRODES,  ELECTRODE,  HYDROGEN,  ALLOY, 
ALLOYS,  BATTERY,  AIR,  OXYGEN,  OXIDATION,  WATER,  and 
CONDUCTIVITY. 
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FIGURE  3C-1  -  EXAMPLE  TWENTY  PHRASE  DENDROGRAM 


2)  A  table  that  contains  a  quantitative  measure  of  the  similarity  of  adjoining 
phrases  or  phrase-cluster  pairs.  The  similarity,  or  ‘distance’,  is  obtained  by 
matching  the  profiles  as  described  above.  Figure  3C-2,  for  example,  is  a  table 
that  contains  the  information  portrayed  in  Figure  3C-1.  The  distances  shown  on 
the  dendrogram  are  taken  from  the  distances  given  in  this  table;  thus,  the  table 
is  the  numerical  expression  of  the  dendrogram. 

FIGURE  3C-2  -  ELEMENTAL  STEPS  IN  DENDROGRAM  FORMATION 


joining 
Cluster  1 

Size  1 

with 

Cluster  2 

Size  2 

Distance 

Cathode 

1 

anode 

1 

32.83216927 

discharge 

1 

charge 

1 

34.16843369 

Cathode 

2 

current  density 

1 

37.9232786 

electrodes 

1 

electrode 

1 

38.29297045 

electrochemical 

properties 

1 

discharge  capacity 

1 

38.68748862 
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lithium 

1 

Li 

1 

39.00662295 

discharge 

2 

battery 

1 

39.06016967 

electrolyte 

1 

Cathode 

3 

39.33157288 

hydrogen 

1 

reduction 

1 

39.42153664 

Ni 

1 

oxidation 

1 

39.52735747 

cyclic 

voltammetry 

1 

X-ray  diffraction 

1 

39.58094847 

electrochemical 

properties 

2 

cyclic  voltammetry 

2 

39.85159646 

hydrogen 

2 

room  temperature 

1 

39.86541946 

hydrogen 

3 

Ni 

2 

39.97396716 

electrodes 

2 

discharge 

3 

39.97805457 

lithium 

2 

hydrogen 

5 

40.00961352 

lithium 

7 

electrochemical 

properties 

4 

40.04779843 

lithium 

11 

electrodes 

5 

40.17291601 

electrolyte 

4 

lithium 

16 

40.26135359 

3)  A  taxonomy  of  a  pre-specified  number  of  groups  of  phrases.  Figure  3C-3,  for 
example,  shows  the  groupings  of  phrases  when  four  clusters  were  specified  for 
the  data  portrayed  in  Figure  3C-1. 

FIGURE  3C-3 


Cluster  # 

Phrases 

1 

electrolyte 

2 

lithium 

3 

electrodes 

3 

electrode 

2 

Li 

1 

Cathode 

2 

hydrogen 

2 

room  temperature 

4 

electrochemical  properties 

4 

cyclic  voltammetry 

2 

reduction 

1 

anode 
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3 

discharge 

4 

X-ray  diffraction 

2 

Ni 

4 

discharge  capacity 

3 

battery 

3 

charge 

2 

oxidation 

1 

current  density 

Final  categories  can  be  generated  two  ways.  The  clusters  can  be  manually  grouped 
into  categories.  Or,  the  number  of  categories  (clusters)  can  be  specified  to  the 
algorithm,  and  the  computer  output  will  contain  the  final  categories.  Usually, 
some  modest  manual  re-grouping  of  the  computer  output  groups  is  required  to 
arrive  at  a  final  recommended  taxonomy. 
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Appendix  4  -  Author  Factor  Matrix 


T-rMco^-wtoscoo) 

ooooooooo 


uooouuuuo 


2-2-2-2000000000000000000000000000000000000000000 


JU-MC 

YOU-YH 

PARK-CH 

PAIK-JH 

CHO-JW 

SEKI--K 

SANADA-Y 

WANG--Q 

BHARGAVA-VK 

NAKAGAWA-M 

KIM--DI 

TANAKA-Y 

ROITHMEIER-A 

BEHBAHANI-F 

ABIDI-AA 

LEETE-JC 

SAMUELI-H 

KAVEHRAD-M 

HIGUCHI-K 

SAWAHASHI-M 

OKAWA-K 

FUKUMOTO-S 

ADACHI-F 

ANDOH-H 

NAKASE-H 

TSUBOUCHI— K 

SUDA-H 

KATAYAMA-M 

OGAWA-A 

YAMAZATO-T 

JAMALIPOUR-A 

KIM-J 

OHTA-A 

SUGIYAMA-T 

UMEHIRA-M 

MANABE-T 

AIKAWA-S 

SUZUKI— H 

KUBOTA-S 

SATO-H 

HASHIMOTO-A 

SHINAGAWA-N 

NAKANO-K 

SENGOKU-M 

KOBAYASHI-T 

MORIKURA-M 

TAKANASHI-H 

IIZUKA-M 

MIZOGUCHI— M 

GIANNETTI-F 

LUISE—M 

DE  GAUDENZI-R 

DEGAUDENZI-R 

CORAZZA-GE 

LENZINI-L 

EDWARDS-DJ 

O'BRIEN— DC 

SAMARAS-K 

THOMPSON-JS 

MULGREW-B 

GRANT-PM 

CHIASSERINI— CF 

FUMAGALLI-A 

MARSAN-MA 

CHEN-JC 

AGRAWAL-P 

SIVALINGAM-KM 

CALDERBANK-AR 

TAROKH-V 

SESHADRI-N 

PAULRAJ-A 

TANAKA-S 

IHARA-T 

HARADA-A 

MUKUMOTO-K 

FUKUDA-A 

WU-G 
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FANTACCI-R 
GIAMBENE-G 
DEL  RE-E 
BENELLI-G 
ZORZI-M 

CHOCKALINGAM-A 

RAO-RR 

VENKATARAM-P 

MILSTEIN-LB 

LAMAIRE-RO 

ACAMPORA-AS 

HASE-Y 

INOUE-M 

XYLOMENOS-G 

POLYZOS-GC 

MAHONEN-P 

MORINAGA-N 

SAMPEI-S 

ABETA-S 

GEMELOS-SM 

SHRIKHANDE-K 

ARNOLD-JF 

FRATER-MR 

KWOK-YK 

LAU-VKN 

AGUSTI-R 

SALLENT-O 

CRYAN-RA 

ELMIRGHANI-JMH 

LAU-FCM 

TAM-WM 

JEONG-DG 

JEON-WS 

KIM-D 

NIX-A 

BULL-D 

SOLLENBERGER-NR 

CIMINI-LJ 

LI— Y 

CHUANG— JCI 

WINTERS-JH 

CHANG-LF 

FALCONER-DD 

LOPEZ-HERNANDEZ-FJ 

PEREZ-JIMENEZ-R 

MARK-JW 

SHEN-XM 

MCGEEHAN-JP 

NORTON-GH 

NIX--AR 

SUN--Y 

HASEGAWA-T 

TAKAHASHI-Y 

MATSUMOTO-Y 

OKUMURA-Y 

DOHI-T 

CONTI— M 

GREGORI-E 

TAFAZOLLI-R 

EVANS-BG 

OKADA-T 

SHIRATO-T 

TSENG-YC 

SHEU-JP 

SONG-YJ 

LEE--S 

CHO-DH 

KIM-K 

LEE-JY 
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CHEN-HH 

SASASE— I 

OHTSUKI— T 
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TOH-CK 
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KIM-H 
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TOMBA-L 

BENVENUTO-N 

HAARDT-M 

LIN-CR 

WALKE-B 

HOPPER-A 

LEE--JH 

KIM--Y 

HERRMANN-C 

BROWNE-J 

KIM-JY 

KOHNO-R 

ONOZATO-Y 

LYAKHOV-AI 

LEUNG-VCM 

MANDAYAM-NB 

CAMPBELL-AT 

PRASAD-R 

PAHLAVAN-K 

MERAKOS-L 

GARCIA-LUNA-ACEVES— JJ 

COX-DC 

CHANG-PR 

KELLER-D 

HAAS-ZJ 

CHLAMTAC-I 

MATSUMOTO-T 

AGHVAMI-AH 

KAHN-JM 

WOLISZ— A 

CHEN-KC 

LEE-ML 

MOH-WM 

SINGH-S 

GUPTA-S 

JOHNSON-DB 

SUNG-DK 

KANG-CG 

SHEU-ST 

GLISIC-S 

STAVRAKAKIS-I 

KAMERMAN-A 

VEERARAGHAVAN-M 

FEHER-K 

GOLDBERG-L 

BOLSENS— I 

OHSAWA-T 

HOLTZMAN-JM 

MORGERA-SD 

HANLY-SV 

UN-CK 

KIM-JH 

HANZO-L 

HEFTMAN-G 

LINNARTZ-JPMG 

LIN--SY 

LEE--TH 

BEN  LETAIEF-K 

CAO--ZG 

AKYILDIZ-IF 

JOE-1 

LI— J 

RAYCHAUDHURI-D 

GIANNAKIS-GB 

BISWAS-SK 

CHOI-S 

SHIN-KG 

LI — F 

WU-Q 

MIZUNO-T 

SUZUKI— T 

TASAKA-S 

FUJII-T 

ARIYAVISITAKUL— S 
GREENSTEIN-LJ 
CHUA-KC 
LYE-KM 
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Appendix  5  -  Prolific  Author  Dendrogram 
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Appendix  6  -  Country  Co-Occurrence  Matrix  -1991  -  mid-2002 
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Appendix  7  -  Word  Factor  Matrix 
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Factor 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

arq  :;i; 

0.004 

0.072 

0.109 

0.021 

0.054 

0.08 

0.006 

0.033 

0.001 

0.011 

0.004 

error  0.434 

0.075 

0.095 

0.155 

0.008 

0.058 

0.016 

0.173 

0.119 

0.119 

0.024 

0.276 

automatic  .  j 

0.003 

0.048 

0.102 

0.03 

0.004 

0.089 

0.002 

0.035 

0.026 

0.027 

0.008 

Correction 

0.057 

0.033 

0.071 

0.028 

0.034 

0.013 

0.003 

0.05 

0.006 

0.072 

0.123 

retransmission 

0.079 

0.078 

0.082 

0.023 

0.046 

0.079 

0 

0.065 

0.049 

0.009 

0.012 

request 

0.132 

0.059 

0.102 

0.06 

0.037 

0.001 

0.029 

0.012 

0.007 

0.049 

0.006 

bit  0.324 

0.03 

0.045 

0.082 

0.078 

0.007 

0.079 

0.117 

0.128 

0.037 

0.067 

0.247 

data  i 

0.239 

0.188 

0.065 

0.092 

0.042 

0.016 

0.015 

0.003 

0.037 

0.051 

0.077 

link  0  :■  i 

0.081 

0.019 

0.047 

0 

0.031 

0.137 

0.283 

0.013 

0.064 

0.113 

0.09 

rate  ; 

0.033 

0.029 

0.107 

0.115 

0.045 

0.135 

0.133 

0.096 

0.047 

0.053 

0.254 

transport  0  ■ 

0.031 

0.132 

0.041 

0.013 

0.027 

0.135 

0.09 

0.089 

0.051 

0.064 

0.041 

throughput 

0.299 

0.088 

0.106 

0.067 

0.067 

0.097 

0.02 

0.032 

0.184 

0.086 

0.035 

errors 

0.292 

0.037 

0.072 

0.024 

0.021 

0.039 

0.013 

0.025 

0.026 

0.033 

0.001 

0.055 

TCP 

0.289 

0.067 

0.004 

0.024 

0.002 

0.057 

0.023 

0.041 

0.136 

0.026 

0.099 

0.1 

forward 

0.285 

0.028 

0.011 

0.074 

0.018 

0.023 

0.049 

0.184 

0.029 

0.01 

0.151 

0.034 

layer 

0.282 

0.038 

0.142 

0.038 

0.055 

0.053 

0.041 

0.009 

0.05 

0.068 

0.134 

0.077 

links 

0.268 

0.084 

0.049 

0.021 

0.005 

0.042 

0.047 

0.043 

0.123 

0.001 

0.033 

0.035 

coding 

0.266 

0.039 

0.026 

0.049 

0.053 

0.007 

0.016 

0.137 

0.041 

0.041 

0.053 

0.203 

transmission 

0.266 

0.22 

0.034 

0.039 

0.015 

0.034 

0.001 

0.088 

0.022 

0.005 

0.058 

0.134 

packet 

0.258 

0.033 

0.09 

0.034 

0.082 

0.033 

0.009 

0.164 

0.095 

0.024 

0.022 

transfer 

0.256 

0.045 

0.069 

0.018 

0.091 

0.022 

0.289 

0.161 

0.038 

0.065 

0.051 

0.006 

fading 

0.255 

0.148 

0.105 

0.242 

0.095 

0.111 

0.058 

0.088 

0.177 

0.103 

0.29 

video 

0.245 

0.107 

0.151 

0.007 

0.028 

0.007 

0.104 

0.089 

0.068 

0.073 

0.015 

0.017 

congestion 

0.231 

0.009 

0.058 

0.004 

0.005 

0.017 

0.125 

0.072 

0.115 

0.039 

0.002 

0.101 

loss 

0.22 

0.032 

0.054 

0.014 

0.024 

0.02 

0.188 

0.027 

0.007 

0.205 

0.011 

0.052 

ATM 

0.218 

0.007 

0.151 

0.075 

0.131 

0.078 

0.244 

0.062 

0.067 

0.077 

0.018 

control 

0.215 

0.284 

0.001 

0.082 

0.015 

0.027 

0.246 

0.115 

0.043 

0.06 

0.149 

0.099 

unit 

0.212 

0.015 

0.007 

0.063 

0.031 

0.085 

0.055 

0.031 

0.017 

0.029 

0.051 

0.097 

BER 

0.21 

0.096 

0.093 

0.119 

0.018 

0.011 

0.061 

0.097 

0.002 

0.035 

0.199 

reliable 

0.204 

0.038 

0.018 

0.067 

0.008 

0.03 

0.017 

0.051 

0.116 

0.018 

0.073 

0.002 

selective 

0.204 

0.064 

0.054 

0.082 

0.029 

0.033 

0.076 

0.104 

0.035 

0.027 

0.035 

0.169 

channel 

0.203 

0.197 

0.099 

0.222 

0.082 

0.064 

0.001 

0.164 

0.083 

0.169 

0.112 

0.224 

wireless 

0.201 

0.145 

0.275 

0.075 

0.002 

0.083 

0.091 

0.182 

0.085 

0.011 

0.221 

0.01 

MAC 

0.041 

0.46 

0.043 

0.033 

0.05 

0.054 

0.01 

0.031 

0.011 

0.016 

0.158 

0.043 

collision 

0.094 

0.444 

0.068 

0.053 

0.035 

0.047 

0.136 

0.024 

0.035 

0.073 

0.102 

0.024 

access 

0.036 

0.441 

0.235 

0.026 

0.007 

0.104 

0.009 

0.085 

0.116 

0.059 

0.235 

0.108 

protocol 

0.178 

0.429 

0.072 

0.188 

0.098 

0.14 

0.022 

0.106 

0.268 

0.008 

0.13 

0.091 

traffic 

0.107 

0.413 

0.01 

0.103 

0.023 

0.077 

0.281 

0.147 

0.06 

0.055 

0.108 

0.07 

contention 

0.019 

0.411 

0.016 

0.011 

0.01 

0.027 

0.024 

0.02 

0.023 

0.029 

0.053 

0.003 

medium 

0.04 

0.365 

0.086 

0.048 

0.039 

0.039 

0.015 

0.015 

0.006 

0.001 

0.104 

0.015 

reservation 

0.014 

0.352 

0.024 

0.087 

0.071 

0.056 

0.113 

0.103 

0.045 

0.022 

0.075 

0.012 

priority 

0.029 

0.321 

0.063 

0.067 

0.059 

0.044 

0.111 

0.088 

0.003 

0.022 

0.063 

0.009 

slots 

0.012 

0.313 

0.054 

0.047 

0.063 

0.021 

0.073 

0.057 

0.009 

0.062 

0.073 

0.036 

IEEE 

0.035 

0.295 

0.147 

0.023 

0.017 

0.026 

0.135 

0.041 

0.009 

0.06 

0.011 

load 

0.05 

0.291 

0.053 

0.01 

0.007 

0.024 

0.087 

0.051 

0.05 

0.053 

0.061 

0.043 

multiple 

0.079 

0.289 

0.031 

0.08 

0.001 

0.049 

0.016 

0.153 

0.1 

0.115 

0.196 

slot 

0.007 

0.288 

0.045 

0.019 

0.085 

0.009 

0.031 

0.064 

0.016 

0.029 

0.025 

0.039 

delay 

0.14 

0.285 

0.132 

0.011 

0.071 

0.058 

0.115 

0.025 

0.017 

0.182 

0.07 

0.07 

voice 

0.089 

0.271 

0.139 

0.115 

0.113 

0.044 

0.103 

0.122 

0.065 

0.003 

0.123 

0.031 

packets 

0.178 

0.263 

0.108 

0.084 

0.056 

0.042 

0.063 

0.005 

0.243 

0.022 

0.005 

0.001 

sense 

0.098 

0.257 

0.002 

0.065 

0.022 

0.038 

0.123 

0.052 

0.007 

0.127 

0.067 

0.057 

802.11 

0.02 

0.257 

0.098 

0.005 

0.002 

0.014 

0.112 

0.035 

0.008 

0.056 

0.291 

0.022 

time 

0.042 

0.249 

0.015 

0.159 

0.009 

0.007 

0.078 

0.009 

0.073 

0.155 

0.045 

0.034 

CSMA/CD 

0.112 

0.245 

0.1 

0.066 

0.082 

0.046 

0.135 

0.005 

0.041 

0.12 

0.169 

0.055 

ALOHA 

0.014 

0.241 

0.086 

0.077 

0.041 

0.071 

0.081 

0.007 

0.021 

0.143 

0.005 

0.001 

station 

0.066 

0.238 

0.019 

0.048 

0.013 

0.022 

0.055 

0.179 

0.038 

0.076 

0.265 

0.235 

real-time 

0.059 

0.236 

0.041 

0.019 

0.044 

0.022 

0.098 

0.071 

0.036 

0.008 

0.039 

0.04 

slotted 

0.016 

0.232 

0.061 

0.073 

0.005 

0.055 

0.09 

0.003 

0.058 

0.142 

0.019 

0.014 

frame 

0.135 

0.223 

0.048 

0.001 

0.003 

0.026 

0.01 

0.001 

0.036 
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Appendix  9  -  Word  Dendrogram 
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Appendix  10  -  Phrase  Dendrogram 
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APPENDIX  11  -  GREEDY  STRING  TILING  ELEMENTAL  CLUSTERS 

(93)  Cluster  1  (cdma,  access,  multiple,  system,  code,  division,  mobile,  systems, 
wireless)  -  focuses  on  CDMA  (Code-Division  Multiple  Access)  Schemes  for 
mobile  wireless  systems 

(62)  Cluster  2  (atm,  wireless,  network,  mobile,  mobility,  service,  services,  qos, 
protocol)  -  focuses  on  ATM  (Asynchronous  Transfer  Mode)  within  wireless 
networks  to  guarantee  Qos  (Quality  of  Service)  support  at  all  times. 

(43)  Cluster  3  (cdma,  code,  access,  multiple,  system,  mobile,  division, 
sequence,  error,  direct,  wireless,  communications,  multipath,  fading,  receiver, 
systems,  channel,  signal,  bitcommunication,  interference)  -  focuses  on  an 
interference  cancellation  receiver  for  applications  in  DS/CDMA  (Direct-Sequence 
Code-Division  Multiple  Access)  uplink  communications. 

(42)  Cluster  4  (access,  wireless,  mac,  protocol,  network,  atm,  control,  traffic, 
channel,  based,  paper,  networks,  scheme,)  -  focuses  on  MAC  (Medium  Access 
Control)  protocols  of  wireless  ATM  (Asynchronous  Transfer  Mode)  networks  to 
improve  Bit  rate. 

(42)  Cluster  5  (cdma  ,  system  ,  capacity  ,  link ,  interference  ,  mobile,  cell , 
access,  multiple,  cellular  ,  systems,  division,  traffic  ,  network,  code,  base  ,  user 
,  power  ,  station)  -  focuses  on  single  and  multiple-cell  Code-division  Multiple 
Access  networks  developed  from  traffic  models  to  improve  Quality  of  Service 
support. 

(39)  Cluster  6  (wireless  ,  atm  ,  access  ,  system  ,  systems  ,  services  ,  networks, 
broadband  ,  based  ,  network ,  protocol ,  mobile,  multimedia,  high,)  -  focuses 
on  ATM  (Asynchronous  Transfer  Mode)-based  broadband  networks  to  transmit 
data. 

(38)  Cluster  7  (cdma,  interference,  code,  access,  multiple,  division,  systems, 
user,  channel,  signal,  ds  mobile,  time,  channels  based,  sequence,  multipath,)  - 

focuses  on  DS-CDMA  (Direct- Sequence  Code-Division  Multiple  Access)  systems 
in  multipath  fading  channels. 

(36)  Cluster  8  (ad,  hoc,  network,  networks,  mobil,  routing,  wireless,  protocol, 
nodes,  based,  data,  system,  communication,  new,  cellular,  hosts, 
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infrastructure)  -  focuses  on  the  routing  protocols  and  infrastructure  within 
standard  ad-hoc  wireless  networks. 

(35)  Cluster  9  (power,  control,  system,  capacity,  cdma,  mobile,  interference, 
base,  users,  access,  cell,  fading,  station,  cellular,  multiple,  link,  systems)  - 

focuses  on  TPC  (Transmit  Power  Control)  based  on  signal-to-interference  plus 
backround  noise  power  ratio  being  applied  under  multipath  fading  environments. 

(34)  Cluster  10  (voice,  data,  protocol,  access,  wireless,  traffic,  packet,  network, 
channel,  paper,  delay,  reservation,  scheme,  terminals  throughput,  stations)  - 

focuses  on  a  Medium  Access  Control  protocol  that  integrates  voice  and  data 
services  in  wireless  LANs. 

(33)  Cluster  1 1  (diversity,  systems,  antenna,  interference,  system,  fading, 
wireless,  receiver,  signal,  multipath,  multiple,  rake,  combining,  antennas)  - 

focuses  on  antenna  diversity  to  overcome  multipath  fading. 

(32)  Cluster  12  (wireless,  area,  local,  network,  networks,  systems,  design, 
system,  communications,  frequency,  wLAN,  range,  propagation)  -  focuses  on 
multipath  propagation  problems  that  are  common  within  wireless  local  area 
networks. 

(30)  Cluster  13  (cdma,  receiver,  interference,  access,  antenna,  multiple, 
channel,  time,  signal,  code,  division,  adaptive,  error,  power,  systems,  users, 
rate)  -  focuses  on  W-CDMA  (Wideband  Direct  Sequence  Code  Division  Multiple 
Access)  schemes,  employing  an  adaptive  antenna  array  to  reduce  multiple  access 
interference. 

(30)  Cluster  14  (power,  channel,  link,  interference,  fading,  average,  rake, 
transmit,  cdma,  signal,  pilot,  diversity,  ber)  -  focuses  on  the  use  of  fast  TPC 
(Transmit  Power  Control)  and  coherent  Rake  combining  to  improve  upon  DS- 
CDMA  (Direct- Sequence  Code-Division  Multiple  Access)  schemes  in  increasing 
link  capacity. 

(30)  Cluster  15  (rate,  system,  frequency,  interference,  error,  wireless, 
transmission,  channel,  bit,  method,  high,  fading,  multipath)  -  focuses  on  BER 
(Bit  Error  Rate)  performance  in  the  presence  of  frequency-selective  multipath 
fading  channels. 
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(29)  Cluster  16  (channel,  wireless,  error,  control,  data,  based,  scheme,  packet, 
protocol,  access,  protocols,  layer,  schemes)  -  focuses  on  exploiting  the  energy 
between  the  MAC  (Multiple  Access  Control)  layer  and  the  physical  layer  of  a 
cellular  wireless  system.  This  is  known  as  an  adaptive  error-control  scheme. 

(28)  Cluster  17  (handoff,  cell,  path,  network,  atm,  scheme,  mobile,  handover, 
rerouting,  wireless,  paper,  switch,  connection,  control)  -  focuses  on  handoff 
management  schemes  for  wireless  ATM  (Asynchronous  Transfer  Mode)  networks. 

(28)  Cluster  18  (protocol,  access,  channel,  traffic,  wireless,  mac,  multiple, 
station,  base,  mobile,  reservation,  control,  based,  data)  -  focuses  on  MAC 
(Multiple  Access  Control)  protocols  and  their  performance  in  improving  data 
transmission. 

(26)  Cluster  19  (csma,  cd,  network,  system,  channel,  protocol,  delay,  model, 
throughput,  packet,  persistent)  -  focuses  on  throughput,  packet  delay,  and 
packet  loss  rate  in  CSMA  (Carrier- Sense  Multiple  Access)  protocols. 

(26)  Cluster  20  (access,  wireless,  radio,  data,  channel,  throughput,  system, 
network,  high,  packet,  service,  protocol,  control,  based,  signal,  rate)  -  focuses 
on  increasing  throughput  and  decreasing  packet  loss  within  wireless  data  and  radio 
transmissions. 

(26)  Cluster  21  (routing,  protocol,  networks,  ad,  hoc,  network,  route,  protocols, 
wireless,  nodes,  node,  mobile)  -  focuses  on  network  routing  protocols  within 
wireless  ad-hoc  networks. 

(26)  Cluster  22  (system,  control,  power,  cdma,  transmission,  wireless,  channel, 
rate,  based,  mobile,  scheme,  access,  code,  traffic,  multiple)  -  focuses  on  fast 
TPC  (Transmit  Power  control)  for  higher-quality  data  transmissions  with  an 
improved  BER  (Bit  Error  Rate). 

(25)  Cluster  23  (network,  csma,  cd,  protocol,  system,  access,  simulation, 
collision,  multiple,  detection,  carrier,  sense,  networks,  local,  area,  throughput) 

-  focuses  on  CSMA/CD  (Carrier  Sense  Multiple  Access/Collision  detection) 
protocols  to  increase  throughput  within  wireless  LANs. 

(24)  Cluster  24  (ghz,  db,  noise,  v,  power,  frequency,  voltage,  chip,  low, 
technology,  supply,  cmos,  circuit,  wireless,  gain,  dbm,  phase)  -  focuses  on 
CMOS  technology  that  mainly  encompasses  circuitry  and  chips. 
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(23)  Cluster  25  (ieee,  wireless,  access,  data,  standard,  protocol,  mac,  LAN, 
mechanism,  high,  system,  time,  networks,)  -  focuses  on  standard  MAC  (Medium 
Access  Control)  protocols  within  IEEE  802.1 1  wireless  networks. 

(23)  Cluster  26  (system,  cdma,  mobile,  generation,  services,  systems,  data, 
radio,  station,  high,  technology,  rate,  wireless)  -  focuses  on  W-CDMA 
(Wideband-  Code-Division  Multiple  Access)  radio  transmission  technologies, 
offering  improved  voice  and  data  transmissions. 

(23)  Cluster  27  (wireless,  services,  mobile,  network,  networks,  service,  ip, 
internet,  data,  systems,  access,  new,  support,  applications)  -  focuses  on 
different  applications  of  and  support  for  wireless  mobile  networks. 

(22)  Cluster  28  (mobile,  ip,  internet,  wireless,  protocol,  network,  cellular, 
mobility,  commerce,  tcp,  layer,  new,  data,  multimedia)  -  focuses  on  mobile 
wireless  network  IP  (Internet  Protocol)  and  TCP  (Transmission  Control  Protocol) 
to  establish  a  connection  between  two  hosts  for  transmission  of  data. 

(22)  Cluster  29  (tcp,  wireless,  link,  throughput,  packet,  protocol,  network, 
mobile,  end,  congestion,  loss,  paper,  networks,  layer,  links,  transport)  - 

focuses  on  TCP  (Transmission  Control  Protocol)  which  establishes  a  link  between 
a  destination  and  a  source  to  send  and  receive  packets.  Many  factors  including 
congestion  and  packet  loss  determine  the  throughput. 

(22)  Cluster  30  (voice,  data,  packet,  system,  protocol,  access,  delay,  time, 
traffic,  multiple,  wireless,  packets,  systems,  integrated)  -  focuses  on  MAC 
(Multiple  Access  Control)  protocols  for  integrated  voice  and  data  services  in 
wireless  local  networks. 

(22)  Cluster  3 1  (network,  ip,  wireless,  mobile,  services,  internet,  protocol, 
networks,  qos,  architecture,  support,  service,  management,  data,  mobility)  - 

focuses  providing  IP  multicasting  with  QoS  (Quality  of  service)  support  over  not 
only  wired  networks,  but  now  with  wireless  mobile  networks  as  well. 

(22)  Cluster  32  (wireless,  systems,  networks,  mobile,  access,  broadband,  atm, 
services,  technologies,  communications,  future,  network,  high)  -  focuses  on 
broadband  technoloigies  being  employed  by  wireless  mobile  networks  to  increase 
throughput. 
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(22)  Cluster  33  (traffic,  atm,  network,  voice,  networks,  mobile,  rate,  delay, 
wireless,  bit,  access,  bandwidth,  paper,  time)  -  focuses  on  ATM  (Asynchronous 
transfer  Mode)  technique  in  providing  more  efficient  multimedia  communications, 
as  the  demand  for  it  has  continued  to  increase. 

(21)  Cluster  34  (fading,  cdma,  channel,  diversity,  access,  multiple,  code,  error, 
mobile,  division,  direct,  sequence,  ds,  system,  probability,  combining,  scheme, 
shift,  satellite,  shadowing,  phase,  keying)  -  focuses  on  QPSK  (Quadrature  Phase 
Shift  Keying)  and  coherent  combining  in  CDMA  (Code-Division  Multiple  Access) 
systems  to  combat  multipath  Rayleigh  fading  channels. 

(20)  Cluster  35  (mobile,  network,  location,  wireless,  atm,  networks,  scheme, 
connection,  hlr,  systems,  management,  mobility)  -  focuses  on  HLR  (Home 
Location  Register),  which  stores  service  and  location  information  for  each  mobile 
registered  within  a  PCS  network. 

(20)  Cluster  36  (satellite,  mobile,  system,  cdma,  channel,  systems,  leo,  access, 
diversity,  link,  power,  capacity,  terrestrial,  low,  multiple,  communication, 
earth,  orbit)  -  focuses  CDMA  (Code-  Division  Multiple  Access)  protocols  that 
have  been  proposed  for  personal  communications  networks  for  both  terrestrial  and 
satellite  links. 

(20)  Cluster  37  (wireless,  network,  access,  networks,  based,  control,  system, 
protocol,  issues,  time,  mobile,  algorithm,  atm,  model,  data,  high,  spectrum, 
multiple,  error,  communications,  broadband)  -  focuses  on  standard  protocols 
within  a  wireless  ATM(Asynchronous  Transfer  Mode)  network  and  the  issues  that 
these  protocols  deal  with  in  transferring  data. 

(20)  Cluster  38  (multicast,  mobile,  networks,  wireless,  network,  tree,  protocol, 
based,  routing,  group,  ad,  hoc,  multicasting,  source,  paper,  nodes)  -  focuses  on 
the  multicast  tree  problem  that  ad-hoc  networks  experience.  This  problem  deals 
with  two  nodes  within  the  network  trying  to  communicate  with  one  another. 

(19)  Cluster  39  (power,  control,  algorithm,  system,  interference,  signal,  cdma, 
sir,  transmission,  systems,  scheme,  wireless)  -  focuses  on  an  adaptive  SIR 
(Signal-to-Interference  Ratio)  based  power  control  mechanism  for  CDMA  (Code- 
Division  Multiple  Access)  systems  that  provides  better  outage  probabilities  and  a 
better  response  time. 
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(19)  Cluster  40  (mobile,  wireless,  radio,  systems,  access,  ip,  system,  network, 
based,  generation,  services,  internet)  -  focuses  on  standard  IP  (Internet  Protocol) 
within  mobile  wireless  radio  networks. 

(18)  Cluster  41  (network,  mobile,  routing,  wireless,  algorithm,  networks, 
based,  time,  mobility,  cluster,  ad,  hoc,  topology)  -  focuses  on  the  routing  and 
topology  of  a  standard  wireless  ad-hoc  network. 

(18)  Cluster  42  (time,  real,  traffic,  access,  protocol,  tdma,  wireless,  multiple, 
communication,  messages,  network,  channel,  delay,  protocols)  -  focuses 
TDMA  (Time-Division  Multiple  Access)  protocols  as  a  means  of  efficient  wireless 
multimedia  communications. 

(18)  Cluster  43  (codes,  cdma,  spreading,  orthogonal,  code,  rate,  turbo, 
transmission,  system,  data,  time,  frequency,  antennas,)  focuses  on  a  dynamic 
spreading  code  selection  technique  to  obtain  low-peak-to-average-power-ratio 
within  a  OFDM/CDMA  (Orthogonal  Frequency  -Division  Multiplexing/Code- 
Division  Multiple  Access)  system 

(17)  Cluster  44  (network,  atm,  mobile,  wireless,  based,  signalling,  architecture, 
networks,  protocol,  paper,  services,  management,  control)  -  focuses  on  the 
standard  architecture  within  a  typical  ATM  (Asynchronous  Transfer  Mode)  mobile 
wireless  network. 

(17)  Cluster  45  (protocol,  access,  throughput,  based,  code,  collision,  wireless, 
multiple,  delay,  time,  channel,  packets,  networks,  receiver,  data)  -  focuses  on 
MAC  (Medium  Access  Control)  protocol’s  ability  to  increase  throughput  by 
avoiding  packet  collision  while  data  is  being  transmitted  through  shared  channels. 

(16)  Cluster  46  (protocol,  csma,  cd,  token,  access,  network,  networks,  ring, 
control,  time,  scheme,  data)  -  focuses  on  CSMA-CD  (Carrier-Sense  Multiple 
Access-Collision  Detection  and  Token-ring  for  deadline  scheduling  for  wireless 
networks. 

(16)  Cluster  47  (rate,  bit,  wireless,  protocol,  time,  atm,  paper,  data,  channel, 
access,  transmission,  error,  scheme,  delay,  techniques)  -  focuses  on  BER  (Bit 
Error  Rate)  within  the  transmission  of  wireless  ATM  (Asynchronous  transfer 
Mode)  networks. 
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(16)  Cluster  48  (arq,  scheme,  throughput,  packet,  wireless,  channel,  error, 
repeat,  retransmission,  type,  transmission,  rate)  -  focuses  on  the  ARQ 
(Automatic  Repeat  Request)  protocol  which  increases  throughput  and  decreases 
the  number  of  transmitted  fragmented  packets  by  retransmitting  them  until  they  are 
corrected. 

(15)  Cluster  49  (scheme,  fading,  channel,  channels,  wireless,  frequency,  error, 
equalizer,  systems,  paper,  equalization,  mode,  ip,  data,  new,  rate)  -  focuses  on 
DFE  (Descision  Feedback  Equalization)  schemes  to  eliminate  multipath  fading 
channels. 

(15)  Cluster  50  (qos,  time,  service,  policies,  wireless,  region,  call,  scheduling, 
traffic,  access,  admission,  system,  packet,  real,  bandwidth)  -  focuses  on 
providing  QoS  (Quality  of  Service)  for  mobile  wireless  networks  through 
bandwidth  allocation. 

(15)  Cluster  51  (error,  wireless,  cell,  atm,  video,  networks,  loss,  errors,  mpeg, 
data,  network,  quality,  cells,  rate,  layer,  bit,)  -  focuses  on  an  ATM 
(Asynchronous  Transfer  Mode)  multimedia  wireless  network  that  is  efficient  in 
video  transmission  with  an  impressive  BER  (Bit  Error  Rate). 

(15)  Cluster  52  (antenna,  ghz,  radiation,  microstrip,  wireless,  array,  patch, 
two,  frequency,  LAN,  operating,  sons,  impedance,  band,  mhz,  dual,  fed, 
applications)  focuses  on  microstrip  antennas  whose  stable  and  broadside 
copolarized  radiation  patterns  are  satisfactory  for  Bluetooth  application. 

(16)  Cluster  53  (radio,  system,  mobile,  channel,  cdma,  estimation,  systems, 
joint,  algorithm,  detection,  data,  interference,  time,)  -  focuses  on  data 
estimation  in  the  uplink  of  a  synchronous  mobile  radio  system  applying  CDMA 
(Code-Division  Multiple  Access). 

(15)  Cluster  54  (rate,  error,  fading,  channel,  detection,  data,  cdma,  system, 
probability,  decoding,  rayleigh,  code,  bit,  frequency,  transmission,  scheme)  - 

focuses  on  BER  (Bit  Error  Rate)  performance  of  CDMA  (Code-Division  Multiple 
Access)  schemes  in  multipath  rayleigh  fading  channels. 

(15)  Cluster  55  (handoff,  call,  scheme,  calls,  soft,  channel,  method,  new, 
probability,  blocking,  systems,  cdma,  mobile,  multiple)  -  focuses  on  soft 
handoff  techniques  in  DS-CDMA  (Direct  Sequence-Code-Division  Multiple 
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Access)  systems,  providing  mobile  calls  with  seamless  connections  between 
adjacent  cells. 

(14)  Cluster  56  (wireless,  link,  internet,  protocol,  systems,  protocols, 
applications,  layer,  links,  tcp,  security,  ip,  data,)  -  focuses  on  problems  with 
TCP/IP  (Transmission  Control  Protocol/Intemet  Protocol)  in  fixed  and  mobile 
environments.  Some  issues  include:  disadvantaged  links  associated  with 
bandwidth,  high  bit  error  rates,  and  Quality  of  Service. 

(13)  Cluster  57  (power,  control,  cdma,  fading,  channel,  rake,  mobile,  ds, 
multipath,  rate,  transmission,  error,  ber,  frequency,  receiver)  -  focuses  on  the 
performance  of  CDMA  (Code-Division  Multiple  Access)  systems  with  the  use  of 
fast  TPC  (Transmit  Power  Control)  and  Rake  combining,  within  Rayleigh  fading 
channels. 

(13)  Cluster  58  (systems,  spread,  multiple,  fading,  channel,  wireless,  spectrum, 
access,  communications,  diversity,  multipath,  cdma)  focuses  on  achieving 
diversity  in  spread- spectrum  communications  over  fast-fading  multipath  channels. 

(13)  Cluster  59  (power,  protocol,  wireless,  energy,  mobile,  consumption,  low, 
design,  system,  networks,  channel,)  -  focuses  on  a  low  power  MAC  (medium 
Access  Control)  protocol  for  mobile  wireless  networks.  Low  power  consumption  is 
vital  as  more  and  more  mobile  units  are  being  battery  powered. 

(13)  Cluster  60  (protocol,  networks,  wireless,  bandwidth,  time,  data,  protocols, 
mobile,  transmission,  new,  signaling)  -  focuses  on  improving  upon  the  power 
and  bandwidth  restraints  present  in  typical  mobile  wireless  networks. 
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APPENDIX  12  -  PARTITIONAL  CLUSTERING  ELEMENTAL 
CLUSTERS 

(26)  Cluster  0  (multicast  63.9%,  tree  8.3%,  hoc  1.8%,  host  1.5%,  deliveri  1.1%, 
node  1.0%,  mesh  0.9%,  mobil  0.9%,  network  0.8%,  reliabl  0.8%,  flood  0.7%, 
protocol  0.6%,  rout  0.6%,  migrat  0.5%,  rm2  0.5%,  messag  0.5%,  sourc  0.5%, 
join  0.4%,  internet  0.3%,  membership  0.3%,  core  0.3%,  mechan  0.3%,  323 
0.3%,  retransmiss  0.3%,  algorithm  0.2%,  dynam  0.2%,  multipoint  0.2%,  cbt 
0.2%,  address  0.2%,  forward  0.2%,  leav  0.2%,  share  0.2%,  propos  0.2%, 
subcast  0.2%,  packet  0.2%,  overhead  0.1%,  unicast  0.1%,  cluster  0.1%,  effici 
0.1%,  atm  0.1%)  -  focuses  on  the  WMTP  (Wireless  Multicast  Tree  Problem), 
which  occurs  when  two  nodes  within  an  ad-hoc  network  try  to  communicate  with 
one  another. 

(68)  Cluster  1  (voice  36.5%,  data  6.5%,  packet  5.2%,  prma  4.3%,  traffic  2.8%, 
protocol  2.5%,  reserv  2.3%,  integr  1.9%,  termin  1.4%,  access  1.1%,  delai 
0.8%,  servic  0.8%,  system  0.7%,  analysi  0.7%,  scheme  0.7%,  isma  0.6%, 
transmiss  0.6%,  probabl  0.6%,  dynam  0.5%,  propos  0.5%,  channel  0.5%, 
perform  0.5%,  multipl  0.5%,  drop  0.5%,  capac  0.5%,  slot  0.4%,  control  0.4%, 
evalu  0.4%,  load  0.4%,  cdma  0.4%,  code  0.3%,  call  0.3%,  model  0.3%,  analyz 
0.3%,  method  0.3%,  time  0.3%,  aal2  0.3%,  aloha  0.3%,  network  0.3%,  user 
0.3%)  -  focuses  on  integrated  voice  and  data  traffic  in  PRMA  (Packet  Reservation 
Multiple  Access)  schemes. 

(37)  Cluster  2  (call  20.8%,  handoff  17.7%,  probabl  6.1%,  block  3.4%,  drop 
2.9%,  resourc  2.4%,  traffic  2.3%,  admiss  2.2%,  polici  2.1%,  soft  1.8%,  cac 
1.6%,  scheme  1.3%,  method  0.9%,  attempt  0.8%,  cell  0.8%,  cellular  0.7%, 
channel  0.7%,  qo  0.7%,  reserv  0.7%,  system  0.5%,  time  0.5%,  region  0.5%, 
alloc  0.5%,  cdma  0.5%,  arriv  0.5%,  propos  0.4%,  queue  0.4%,  model  0.4%, 
determin  0.4%,  qualiti  0.4%,  dynam  0.4%,  capac  0.3%,  hold  0.3%,  util  0.3%, 

overload  0.3%,  failur  0.3%,  borrow  0.3%,  reduc  0.3%,  control  0.3%,  network 

0.3%)  -  focuses  on  different  handoff  management  schemes  and  their  ability  to 
minimize  dropping  probability  of  handoff  calls  and  blocking  probability  of  new 
calls. 

(57)  Cluster  3  (ofdm  25.3%,  frequenc  6.1%,  symbol  5.1%,  offset  5.1%,  estim 
4.5%,  multiplex  2.6%,  subcarri  2.6%,  orthogon  2.4%,  channel  1.8%,  carrier 
1.3%,  code  1.2%,  synchron  1.1%,  divis  0.9%,  phase  0.9%,  scheme  0.8%, 

subchannel  0.7%,  techniqu  0.7%,  system  0.7%,  cfo  0.7%,  propos  0.7%, 

bluetooth  0.7%,  modul  0.7%,  signal  0.6%,  error  0.6%,  spread  0.6%,  receiv 
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0.6%,  select  0.5%,  cdma  0.5%,  preambl  0.5%,  transmitt  0.5%,  time  0.5%,  rate 
0.5%,  multipath  0.5%,  doppler  0.4%,  us  0.4%,  fade  0.4%,  complex  0.4%, 
multicarri  0.4%,  guard  0.4%,  compens  0.4%)  -  focuses  on  the  use  of  OFDM 
(Orthogonal  Frequency-Division  Multiplexing)  to  cut  down  on  ISI  (Inter-Symbol 
Interference)  while  benefiting  from  reduced  path  loss. 

(69)  Cluster  4  (antenna  42.7%,  beam  4.5%,  arrai  3.8%,  radiat  2.8%,  microstrip 
2.0%,  patch  1.6%,  sector  1.3%,  feed  1.2%,  pattern  1.1%,  ghz  0.9%,  dual  0.8%, 
band  0.8%,  circular  0.7%,  indoor  0.6%,  plane  0.6%,  wLAN  0.5%,  field  0.5%, 
frequenc  0.5%,  design  0.5%,  configur  0.5%,  measur  0.5%,  inc  0.4%,  LAN 
0.4%,  monopol  0.4%,  wave  0.4%,  son  0.4%,  john  0.4%,  wilei  0.4%,  imped 
0.4%,  ground  0.4%,  planar  0.4%,  degre  0.3%,  adapt  0.3%,  applic  0.3%,  describ 
0.3%,  oper  0.3%,  beamwidth  0.3%,  port  0.3%,  polar  0.3%,  divers  0.3%)  - 
focuses  on  microstrip  antennas  whose  stable  and  broadside  copolarized  radiation 
patterns  are  satisfactory  for  Bluetooth  application. 

(84)  Cluster  5  (power  25.3%,  control  11.2%,  sir  3.6%,  algorithm  2.0%,  tpc 
2.0%,  link  1.8%,  fade  1.6%,  outag  1.6%,  interfer  1.6%,  cdma  1.5%,  capac 
1.5%,  user  1.3%,  transmit  1.3%,  probabl  1.1%,  station  1.1%,  method  0.9%, 
base  0.8%,  signal  0.8%,  system  0.8%,  radio  0.8%,  mobil  0.8%,  ratio  0.7%, 
revers  0.6%,  cellular  0.6%,  rate  0.6%,  code  0.5%,  receiv  0.5%,  propos  0.5%, 
step  0.5%,  fast  0.5%,  cell  0.4%,  shadow  0.4%,  optim  0.4%,  optimum  0.4%, 
scheme  0.4%,  rayleigh  0.4%,  rake  0.4%,  channel  0.4%,  error  0.4%,  effect 
0.4%)  -  focuses  on  the  use  of  fast  TPC  (Transmit  Power  Control)  to  keep  SIRs 
(Signal-to-Interference  plus  backround  noise  Ratio)  at  a  satisfactory  level. 

(Ill)  Cluster  6  (rout  37.4%,  hoc  8.6%,  node  7.9%,  network  3.6%,  protocol 
2.5%,  mobil  2.0%,  host  1.7%,  locat  1.6%,  cluster  1.4%,  topolog  1.0%, 
algorithm  1.0%,  multihop  0.8%,  scheme  0.6%,  manag  0.6%,  connect  0.6%, 
packet  0.5%,  dynam  0.5%,  destin  0.5%,  chang  0.5%,  path  0.5%,  manet  0.5%, 
inform  0.4%,  zone  0.4%,  agent  0.4%,  distribut  0.3%,  call  0.3%,  pnni  0.3%, 
infrastructur  0.3%,  flood  0.3%,  posit  0.3%,  base  0.3%,  simul  0.3%,  qo  0.3%, 
demand  0.2%,  secur  0.2%,  support  0.2%,  movement  0.2%,  bandwidth  0.2%, 
propos  0.2%,  link  0.2%)  -  focuses  on  the  routing  and  topology  of  a  standard 
wireless  mobile  ad-hoc  network. 

(58)  Cluster  7  (handoff  19.1%,  rerout  9.5%,  handov  9.3%,  cell  8.0%,  path  4.2%, 
atm  3.5%,  switch  3.2%,  connect  2.2%,  mobil  1.8%,  network  1.7%,  buffer 
0.9%,  scheme  0.9%,  architectur  0.8%,  station  0.8%,  optim  0.6%,  manag  0.6%, 
pressur  0.5%,  pvc  0.5%,  propos  0.5%,  transfer  0.5%,  virtual  0.5%,  servic  0.5%, 
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inter  0.4%,  loss  0.4%,  termin  0.4%,  radio  0.4%,  base  0.4%,  ca2  0.4%,  effici 
0.3%,  delai  0.3%,  memori  0.3%,  melt  0.3%,  call  0.3%,  support  0.3%, 
micromobl  0.3%,  reestablish  0.2%,  soft  0.2%,  function  0.2%,  handl  0.2%, 
protein  0.2%)  -  focuses  on  cell  numbering  to  allow  seamless  handoffs  between 
mobile  stations  and  ATM  (Asynchronous  Transfer  Mode)  networks. 

(107)  Cluster  8  (divers  11.5%,  rake  6.5%,  fade  4.0%,  multipath  4.0%,  combin 
3.8%,  antenna  3.8%,  receiv  3.6%,  channel  2.6%,  path  2.0%,  coher  1.5%,  branch 
1.3%,  interfer  1.2%,  averag  1.2%,  pilot  1.2%,  select  1.1%,  signal  1.1%,  cdma 
1.0%,  ber  1.0%,  recept  1.0%,  sector  1.0%,  power  0.9%,  spread  0.8%,  ratio 
0.7%,  effect  0.7%,  indoor  0.6%,  code  0.6%,  estim  0.6%,  error  0.6%,  equal 
0.6%,  perform  0.6%,  scheme  0.6%,  sequenc  0.5%,  system  0.5%,  symbol  0.5%, 
direct  0.5%,  gain  0.5%,  us  0.5%,  transmit  0.5%,  multipl  0.5%,  correl  0.4%)  - 
focuses  increasing  throughput  by  the  use  of  coherent  Rake  combiners  within 
Rayleigh  multipath  fading  channels. 

(121)  Cluster  9  (atm  30.2%,  network  4.5%,  servic  3.3%,  architectur  2.7%,  qos 
2.6%,  manag  2.4%,  mobil  2.2%,  support  1.7%,  broadband  1.3%,  wireless  1.3%, 
signal  1.2%,  multimedia  1.2%,  concept  0.9%,  protocol  0.8%,  connect  0.8%, 
switch  0.8%,  design  0.6%,  capabl  0.6%,  termin  0.6%,  awa  0.6%,  articl  0.6%, 
isdn  0.5%,  user  0.5%,  issu  0.5%,  transport  0.5%,  function  0.5%,  provid  0.5%, 
technolog  0.5%,  control  0.5%,  access  0.4%,  locat  0.4%,  integr  0.4%,  resourc 
0.4%,  describ  0.4%,  infrastructur  0.4%,  implement  0.4%,  discuss  0.4%,  qualiti 
0.4%,  system  0.4%,  call  0.4%)  -  focuses  on  maintaining  guaranteed  levels  of  QoS 
(Quality  of  Service)  within  wireless  ATM  (Asynchronous  Transfer  Mode) 
networks. 

(96)  Cluster  10  (tcp  10.1%,  arq  6.6%,  error  5.8%,  cell  4.3%,  atm  3.0%,  packet 
2.6%,  scheme  2.5%,  link  2.5%,  video  2.2%,  fee  2.2%,  control  2.1%,  loss  2.0%, 
layer  2.0%,  retransmiss  1.3%,  throughput  1.3%,  protocol  1.3%,  congest  1.2%, 
rate  1.1%,  transmiss  1.0%,  repeat  0.9%,  data  0.8%,  mpeg  0.8%,  network  0.7%, 
header  0.7%,  propos  0.7%,  correct  0.7%,  channel  0.6%,  bit  0.6%,  stream  0.6%, 
wireless  0.6%,  mechan  0.6%,  acknowledg  0.6%,  transport  0.6%,  protect  0.5%, 
request  0.5%,  transfer  0.5%,  code  0.5%,  perform  0.5%,  adapt  0.4%,  recoveri 
0.4%)  -  focuses  on  the  performance  of  TCP  (Transmission  Control  Protocols) 
employing  ARQ/FEC  (Automatic  Repeat  Request/Forward  Error  Correction) 
protocols,  to  cut  down  on  packet  error. 

(119)  Cluster  11  (traffic  9.2%,  protocol  4.8%,  mac  4.4%,  alloc  3.5%,  schedul 
2.8%,  servic  1.9%,  resourc  1.8%,  tdma  1.6%,  prioriti  1.5%,  real  1.5%,  access 
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1.5%,  dynam  1.5%,  time  1.5%,  atm  1.4%,  qos  1.4%,  class  1.4%,  propos  1.4%, 
slot  1.3%,  multimedia  1.3%,  packet  1.3%,  algorithm  1.3%,  assign  1.2%,  scheme 
1.2%,  control  1.2%,  delai  1.0%,  bandwidth  1.0%,  channel  0.9%,  base  0.8%, 
throughput  0.8%,  qualiti  0.8%,  reserv  0.7%,  effici  0.7%,  request  0.7%,  fair 
0.6%,  medium  0.6%,  load  0.6%,  network  0.6%,  wireless  0.6%,  rate  0.6%, 
transmiss  0.6%)  -  focuses  on  MAC  (Medium  Access  Control)  Protocols  that 
efficiently  accommodate  real-time  voice  and  video  traffic  in  employing  suboptimal 
resource  allocation. 

(80)  Cluster  12  (802  12.4%,  protocol  9.8%,  ieee  9.4%,  mac  4.7%,  ethernet 
2.5%,  real  1.8%,  time  1.7%,  standard  1.4%,  station  1.1%,  network  1.1%,  wLAN 
1.0%,  dcf  1.0%,  data  1.0%,  LAN  0.9%,  throughput  0.9%,  energi  0.9%,  packet 
0.8%,  save  0.8%,  layer  0.8%,  backoff  0.7%,  model  0.7%,  hidden  0.7%,  frame 
0.7%,  control  0.7%,  medium  0.6%,  traffic  0.6%,  delai  0.6%,  access  0.5%, 
specif  0.5%,  mechan  0.5%,  coordin  0.5%,  termin  0.5%,  csma  0.5%,  power 
0.5%,  tune  0.5%,  transmiss  0.5%,  channel  0.4%,  commun  0.4%,  wireless  0.4%, 
content  0.4%)  -  focuses  on  standard  MAC  (Medium  Access  Control)  protocols  for 
IEEE  802.1 1  wireless  LANs. 

(147)  Cluster  13  (csma  14.8%,  protocol  8.6%,  collis  5.1%,  throughput  3.8%, 
packet  2.8%,  delai  2.3%,  LAN  1.8%,  model  1.8%,  aloha  1.7%,  channel  1.7%, 
sens  1.6%,  network  1.5%,  token  1.4%,  persist  1.3%,  load  1.3%,  messag  1.3%, 
bu  1.3%,  access  1.3%,  station  1.1%,  analysi  0.9%,  perform  0.9%,  time  0.8%, 
ring  0.8%,  slot  0.8%,  detect  0.6%,  retransmiss  0.6%,  analyz  0.6%,  transmiss 
0.5%,  traffic  0.5%,  resolut  0.5%,  result  0.5%,  markov  0.5%,  carrier  0.5%, 
queue  0.5%,  local  0.5%,  stabil  0.4%,  probabl  0.4%,  obtain  0.4%,  base  0.4%, 
random  0.4%)  -  focuses  on  CSMA  (Sarrier-Sense  Multiple  Access)  protocols  that 
increase  throughput  in  focusing  on  packet  collision  avoidance. 

(135)  Cluster  14  (estim  7.1%,  channel  4.4%,  interfer  3.6%,  receiv  3.5%, 
detector  3.3%,  cancel  2.5%,  adapt  2.4%,  multius  2.3%,  equal  2.2%,  algorithm 
2.2%,  multipath  1.9%,  signal  1.7%,  fade  1.6%,  blind  1.4%,  cdma  1.4%,  filter 
1.3%,  dfe  1.1%,  user  1.0%,  time  0.9%,  decis  0.9%,  complex  0.9%,  train  0.8%, 
arrai  0.8%,  process  0.8%,  suppress  0.8%,  propos  0.8%,  sequenc  0.8%,  perform 
0.7%,  antenna  0.7%,  symbol  0.7%,  error  0.7%,  code  0.7%,  space  0.7%,  multipl 
0.6%,  squar  0.6%,  decorrel  0.5%,  system  0.5%,  detect  0.5%,  method  0.5%, 
approach  0.5%)  -  focuses  on  multiuser  detectors  for  cochannel  interference 
cancellation,  developed  by  computing  fixed-lag  MAP  (Maximum  A  Posteriori) 
estimates  of  the  data  symbols  of  the  desired  user. 
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(155)  Cluster  15  (amplifi  5.3%,  ghz  5.0%,  power  3.7%,  filter  3.2%,  modul 
2.8%,  chip  2.8%,  frequenc  2.6%,  circuit  2.5%,  voltag  2.2%,  digit  1.7%,  mhz 
1.6%,  signal  1.6%,  band  1.6%,  suppli  1.5%,  nois  1.4%,  low  1.3%,  integr  1.3%, 
design  1.3%,  oscil  1.2%,  phase  1.2%,  dbm  1.2%,  cmos  1.2%,  spectrum  1.1%, 
analog  1.0%,  output  0.9%,  modem  0.8%,  rang  0.8%,  technolog  0.7%,  transceiv 
0.7%,  receiv  0.7%,  spread  0.7%,  fabric  0.6%,  devic  0.6%,  applic  0.5%, 
implement  0.5%,  compon  0.5%,  oper  0.5%,  vco  0.5%,  nonlinear  0.4%,  local 
0.4%)  -  focuses  on  different  aspects  of  the  CMOS  hardware  or  circuitry 
incorporated  in  wireless  technology. 

(145)  Cluster  16  (code  9.3%,  error  4.7%,  rate  4.0%,  bit  2.9%,  fade  2.9%, 
spread  2.7%,  channel  2.6%,  turbo  2.3%,  decod  2.1%,  scheme  2.1%,  sequenc 
2.0%,  cdma  1.8%,  modul  1.5%,  orthogon  1.4%,  hop  1.3%,  system  1.2%, 
transmiss  1.1%,  frequenc  1.0%,  rayleigh  1.0%,  multipath  0.9%,  perform  0.8%, 
multipl  0.8%,  ber  0.8%,  select  0.6%,  divis  0.6%,  divers  0.6%,  receiv  0.6%, 
techniqu  0.6%,  improv  0.6%,  time  0.5%,  shift  0.5%,  transmit  0.5%,  result 
0.5%,  direct  0.5%,  propos  0.5%,  spectrum  0.5%,  detect  0.5%,  symbol  0.4%, 
data  0.4%,  combin  0.4%)  -  focuses  on  the  use  of  DS\CDMA  (Direct- 
Sequence\Code-Division  Multiple  Access)  systems  that  improve  BER  (Bit  Error 
Rate)  within  Rayleigh  multipath  fading  channels. 

(95)  Cluster  17  (bluetooth  13.4%,  standard  5.3%,  devic  4.3%,  802  3.1%,  ieee 
3.0%,  wLAN  2.7%,  technolog  2.4%,  design  1.8%,  product  1.5%,  test  1.3%, 
LAN  1.3%,  articl  1.2%,  wireless  1.2%,  develop  1.2%,  cost  1.2%,  network 
1.1%,  market  1.0%,  home  0.9%,  band  0.8%,  local  0.8%,  low  0.8%,  electron 
0.8%,  discuss  0.8%,  data  0.8%,  radio  0.8%,  layer  0.8%,  power  0.7%,  ghz  0.7%, 
connect  0.7%,  person  0.7%,  ethemet  0.7%,  commun  0.7%,  describ  0.6%,  equip 
0.6%,  speed  0.6%,  system  0.6%,  interoper  0.6%,  sensor  0.5%,  hiperLAN  0.5%, 
interfac  0.5%)  -  focuses  on  standard  Bluetooth  IEEE  802.11  wireless  LAN 
technology. 

(178)  Cluster  18  (cdma  11.7%,  capac  6.2%,  code  3.1%,  system  3.0%,  interfer 
2.9%,  cell  2.6%,  divis  2.5%,  satellit  2.4%,  user  1.9%,  multipl  1.6%,  link  1.4%, 
acquisit  1.3%,  cellular  1.1%,  mobil  1.1%,  access  1.0%,  signal  1.0%,  techniqu 
0.9%,  channel  0.9%,  spread  0.9%,  downlink  0.9%,  spectrum  0.8%,  frequenc 
0.8%,  radio  0.8%,  power  0.8%,  uplink  0.8%,  tdd  0.7%,  station  0.7%,  detect 
0.6%,  antenna  0.6%,  shown  0.6%,  direct  0.6%,  time  0.6%,  synchron  0.5%, 
search  0.5%,  sequenc  0.5%,  commun  0.5%,  receiv  0.5%,  tdma  0.5%,  consid 
0.5%,  spectral  0.5%)  -  the  use  of  CDMA  (Code-Division  Multiple  Access) 
systems  in  satellite  communications. 
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(74)  Cluster  19  (optic  6.8%,  indoor  4.6%,  wave  3.5%,  reflect  3.1%,  build  2.3%, 
column  2.2%,  wall  2.1%,  method  1.4%,  propag  1.4%,  measur  1.4%,  laser 
1.3%,  calcul  1.3%,  characterist  1.2%,  pressur  1.1%,  temperatur  1 .0%,  multipath 
0.9%,  floor  0.9%,  oxygen  0.9%,  quartz  0.9%,  phase  0.8%,  transmitt  0.8%,  trace 
0.8%,  predict  0.7%,  fiber  0.7%,  rai  0.7%,  offic  0.7%,  environ  0.7%,  signal 
0.7%,  degre  0.6%,  millimet  0.6%,  system  0.6%,  us  0.6%,  materi  0.5%, 
microwav  0.5%,  sfc  0.5%,  lo  0.5%,  model  0.5%,  delai  0.5%,  rang  0.5%, 
increas  0.5%)  -  focuses  on  indoor  infrared  wireless  systems  in  the  field  of  fiber 
optical  networks. 

(233)  Cluster  20  (servic  6.1%,  internet  5.8%,  mobil  3.8%,  network  3.2%, 
technolog  3.2%,  protocol  1.9%,  develop  1.6%,  wireless  1.5%,  system  1.4%, 
commun  1.4%,  support  1.4%,  applic  1.3%,  provid  1.3%,  secur  1.2%,  layer 
1.0%,  futur  1.0%,  gener  1.0%,  broadband  0.9%,  articl  0.8%,  issu  0.8%,  access 
0.8%,  multimedia  0.8%,  architectur  0.8%,  infrastructur  0.8%,  tcp  0.8%,  user 
0.8%,  video  0.7%,  describ  0.7%,  discuss  0.7%,  data  0.7%,  telecommun  0.7%, 
project  0.6%,  person  0.6%,  platform  0.6%,  research  0.6%,  standard  0.5%, 
current  0.5%,  authent  0.5%,  global  0.5%,  comput  0.5%)  -  focuses  on  a  number 
of  different  services  and  protocols  becoming  available  in  current  mobile 
telecommunication  networks. 
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APPENDIX  13  -  SEMINAL  PAPERS  OF  WIRELESS  LANs  LITERATURE 


ABSTRACT 

A  chronically  weak  area  in  research  papers,  reports,  and  reviews  is  the  complete 
identification  of  seminal  background  documents  that  formed  the  building  blocks 
for  these  papers.  A  method  for  systematically  determining  these  seminal 
references  is  presented.  Citation- Assisted  Background  (CAB)  is  based  on  the 
assumption  that  seminal  documents  tend  to  be  highly  cited.  Application  of  CAB  to 
the  field  of  Wireless  LANs  research  is  presented.  While  CAB  is  a  highly 
systematic  approach  for  identifying  seminal  references,  it  serves  as  a  supplement 
and  is  not  a  substitute  for  the  judgment  of  the  authors. 
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INTRODUCTION 


Research  is  a  method  of  systematically  exploring  the  unknown  to  acquire 
knowledge  and  understanding.  Efficient  research  requires  awareness  of  all  prior 
research  and  technology  that  could  impact  the  research  topic  of  interest,  and  builds 
upon  these  past  advances  to  create  discovery  and  new  advances.  The  importance 
of  this  awareness  of  prior  art  is  recognized  throughout  the  research  community.  It 
is  expressed  in  diverse  ways,  including  requirements  for  Background  sections  in 
journal  research  articles,  invited  literature  surveys  in  targeted  research  areas,  and 
required  descriptions  of  prior  art  in  patent  applications. 

For  the  most  part,  development  of  Background  material  for  any  of  the  above 
applications  is  relatively  slow  and  labor  intensive,  and  limited  in  scope. 
Background  material  development  usually  involves  some  combination  of  manually 
sifting  through  outputs  of  massive  computer  searches,  manually  tracking 
references  through  multiple  generations,  and  searching  one’s  own  records  for 
personal  references.  The  few  studies  that  have  been  done  on  the  adequacy  of 
Background  material  in  documents  show  that  only  a  modest  fraction  of  relevant 
material  is  included  (e.g.,  MacRoberts  and  MacRoberts,  1989,  1996;  Liu,  1993). 

Typically  missing  from  standard  Background  section  or  review  article 
development,  as  well  as  in  the  specific  examples  cited  above,  is  a  systematic 
approach  for  identifying  the  key  documents  and  events  that  provided  the 
groundwork  for  the  research  topic  of  interest.  The  present  appendix  presents  such 
a  systematic  approach  for  identifying  the  key  documents,  called  Citation-Assisted 
Background  (CAB),  and  applies  it  to  the  area  of  Wireless  LANs  research. 

Every  published  research  review  on  Wireless  LANs  typically  covers  a  limited 
subset  of  the  technology  rather  than  the  total  discipline.  None  of  these  published 
reviews  has  the  spatial  and  temporal  breadth  of  coverage  of  the  present  report, 
none  uses  a  query  of  the  extent  and  complexity  of  the  present  report,  and  none  uses 
the  systematic  approach  described  here  to  insure  that  all  highly  cited  articles 
related  to  the  discipline  of  interest  are  identified.  In  the  main  body  of  the  present 
report,  we  used  text  mining  techniques  to  query  the  open  literature  to  uncover  the 
infrastructure  of  Wireless  LANs  publications.  In  the  present  appendix,  we  describe 
a  systematic  approach  to  insure  that  all  highly  cited  seminal  articles  related  to 
Wireless  LANs  are  identified.  The  technique  is  used  to  demonstrate  the  utility  of 
the  CAB  approach. 
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CONCEPT  DESCRIPTION 


The  CAB  concept  [Kostoff  and  Shlesinger,  2005d]  identifies  the  seminal 
Background  documents  for  a  research  area  using  citation  analysis.  CAB  rests  on 
the  assumption  that  a  seminal  document  for  a  specific  research  area  will  typically 
have  been  referenced  positively  by  a  substantial  number  of  people  who  are  active 
researchers  in  that  specific  area.  Implementation  of  the  CAB  concept  then  requires 
the  following  steps: 

•  The  research  area  of  interest  must  be  defined  clearly 

•  The  documents  that  define  the  area  of  interest  must  be  identified  and  retrieved 

•  The  references  most  frequently  used  in  these  documents  must  be  identified  and 
selected 

•  These  critical  references  must  be  analyzed,  and  integrated  in  a  cohesive 
narrative  manner  to  form  a  comprehensive  Background  section  or  separate 
literature  survey 

These  required  steps  are  achieved  in  the  following  manner. 

•  The  research  topic  of  interest  is  defined  clearly  by  the  researchers  who  are 
documenting  their  study  results.  In  the  present  study,  this  definition  was 
presented  in  the  Introduction. 

•  The  topical  definition  is  sharpened  further  by  the  development  of  a  literature 
retrieval  query.  In  the  present  study,  the  detailed  query  was  presented  in  the 
Introduction. 

•  The  query  is  entered  into  a  database  search  engine,  and  documents  relevant 
to  the  topic  are  retrieved.  In  the  present  appendix,  6230  research  articles 
were  retrieved  from  the  Web  version  of  the  Science  Citation  Index  (SCI)  for 
the  years  1991  to  mid-2005.  The  SCI  was  used  because  it  is  the  only  major 
research  database  to  contain  references,  in  a  readily  extractable  format. 

•  These  documents  are  combined  to  create  a  separate  database,  and  all  the 
references  contained  in  these  documents  are  extracted.  Identical  references 
are  combined,  the  number  of  occurrences  of  each  reference  is  tabulated,  and 
a  table  of  references  and  their  occurrence  frequencies  is  constructed.  In  the 
present  text  mining  study,  separate  references  with  frequencies  of  two  or 
greater  were  extracted  and  tabulated.  Table  A 13-1  contains  the  ten  highest 
frequency  (most  cited)  references  extracted  from  the  Wireless  LANs 
database. 
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TABLE  A13-1  -  TEN  MOST  HIGHLY  CITED  DOCUMENTS  -  1991-2005 


AUTHOR 

YEAR 

SOURCE 

VOLUME 

PAGE 

#CIT 

GILHOUSEN  KS 

1991 

IEEE  T  VEH  TECHNOL 

V40 

P303 

212 

PROAKIS  JG 

1995 

DIGITAL  COMMUNICATIO 

154 

JAKES  WC 

1974 

MICROWAVE  MOBILE  COM 

134 

JOHNSON  DB 

1996 

MOBILE  COMPUTING 

P153 

133 

PERKINS  CE 

1999 

P  2  IEEE  WORKSH  MOB 

P90 

121 

VITERBI  AJ 

1995 

CDMA  PRINCIPLES  SPRE 

114 

RAPPAPORT  TS 

1996 

WIRELESS  C OMMUNIC ATI 

110 

BROCH  J 

1998 

P  4  ANN  ACM  IEEE  INT 

P85 

108 

RAY CH AUDHURI  D 

1994 

IEEE  J  SEL  AREA  COMM 

V12 

P1401 

104 

GOODMAN  DJ 

1989 

IEEE  T  COMMUN 

V37 

P885 

98 

Two  frequencies  are  computed  for  each  reference,  but  only  the  first  is  shown  in 
Table  A 13-1.  The  frequency  shown  in  the  rightmost  column  is  the  number  of 
times  each  reference  was  cited  by  the  6230  records  in  the  retrieved  database  only. 
This  number  reflects  the  importance  of  a  given  reference  to  the  specific  discipline 
of  Wireless  LANs.  The  second  frequency  number  (not  shown)  is  the  total  number 
of  citations  the  reference  received  from  all  sources  in  all  years  after  publication, 
and  reflects  the  importance  of  a  given  reference  to  all  the  fields  of  science  that 
cited  the  reference.  This  second  number  is  obtained  from  the  citation  field  or 
citation  window  in  the  SCI.  In  CAB,  only  the  first  frequency  is  used,  since  it  is 
topic-specific.  Using  the  first  discipline-specific  frequency  number  obviates  the 
need  to  normalize  citation  frequencies  for  different  disciplines  (due  to  different 
levels  of  activity  in  different  disciplines),  as  would  be  the  case  if  total  citation 
frequencies  were  used  to  determine  the  ordering  of  the  references. 

CONCEPT  IMPLEMENTATION 

To  identify  the  total  candidate  references  for  the  Background  section,  a  table 
similar  in  structure  to  Table  A 13-1,  but  containing  all  the  references  from  the 
retrieved  records,  is  constructed.  A  threshold  frequency  for  selection  can  be 
determined  by  arbitrary  inspection  (e.g..,  a  Background  section  consisting  of  150 
key  references  is  arbitrarily  selected).  The  first  author  has  found  a  dynamic 
selection  process  more  useful.  In  this  dynamic  process,  references  are  selected, 
analyzed,  and  grouped  based  on  their  order  in  the  citation  frequency  table  until  the 
resulting  Background  is  judged  sufficiently  complete  by  the  Background 
developers. 
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To  insure  that  the  influential  documents  published  both  long  ago  and  very  recently 
are  included,  the  following  total  process  is  used.  The  reference  frequency  table  is 
ordered  by  inverse  frequency,  as  above,  and  a  high  value  of  the  selection  frequency 
threshold  is  selected  initially.  Documents  with  citations  above  this  frequency  are 
tagged.  Then,  the  table  is  re-ordered  chronologically.  The  early  historical 
documents  with  citation  frequencies  substantially  larger  than  those  of  their 
contemporaries  are  selected,  as  are  the  extremely  recent  documents  with  citation 
frequencies  substantially  larger  than  those  of  their  contemporaries.  By 
contemporaries,  it  is  meant  documents  published  in  the  same  time  frame,  not 
limited  to  the  same  year  (see  next  paragraph  for  examples  of  how  we  implement 
‘same  time  frame’).  Then,  the  dynamic  selection  process  defined  above  is  applied 
to  the  early  historical  references,  the  intermediate  time  references  (those  falling 
under  the  high  frequency  threshold),  and  the  extremely  recent  references 
(approximately  two  years  or  less). 

Table  A13-2  contains  the  final  references  selected  for  the  Wireless  LANs 
Background  survey.  The  first  reference  listed,  Shannon’s  paper(s)  on  the 
mathematical  theory  of  communication,  had  many  more  citations  than  any  paper 
published  in  the  1900s,  up  to  Price’s  paper  in  1958.  Specifically,  from  the  early 
1900s  to  1957,  the  next  most  cited  paper  had  five  citations.  In  turn,  Price’s  paper 
had  more  citations  than  any  published  previously,  or  those  published  until  Gilbert’s 
paper  in  1960.  Specifically,  from  1958  to  1960,  the  next  most  cited  paper  had  nine 
citations.  This  is  a  graphic  example  of  how  we  interpret  a  paper’s  having 
substantially  more  citations  than  its  contemporaries.  We  do  not  constrain  ourselves 
with  a  numerical  threshold,  but  rather  interpret  the  total  citation  pattern  within  a 
given  time  frame. 

TABLE  A13-2  -  SEMINAL  DOCUMENTS  SELECTED  FOR  INCLUSION  IN 
BACKGROUND 


AUTHOR 

YEAR 

SOURCE 

VOLUME 

PAGE 

#CITES 

SHANNON  CE 

1948 

BELL  SYST  TECH  J 

V27 

P379 

10 

SHANNON  CE 

1948 

BELL  SYST  TECH  J 

V27 

P623 

5 

SHANNON  CE 

1949 

PIRE 

V37 

P10 

5 

PRICE  R 

1958 

PIRE 

V46 

P555 

24 _ 1 

GILBERT  EN 

1960 

BELL  SYST  TECH  J 

V39 

P1253 

38 

LITTLE  JDC 

1961 

OPER  RES 

V9 

P383 

10 

BELLO  PA 

1963 

IRE  T  COMMUN  SYST 

VI 1 

P360 

37j 

SCHWARTZ  M 

1966 

COMMUNICATION  SYSTEM 

23~] 

CHANG  RW 

1966 

BELL  SYST  TECH  J 

V45 

P1775 

16 
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CLARKE  RH 

1968 

BRADY  PT 

1968 

BRADY  PT 

1969 

ABRAMSON  N 

1970 

WEINSTEIN  SB 

1971 

JAKES  WC 

1974 

TOBAGI  FA 

1975 

KLEINROCK  L 

1975 

KLEINROCK  L 

1975 

KLEINROCK  L 

1975 

METCALFE  RM 

1976 

DIFFIE  W 

1976 

PURSLEY  MB 

1977 

COOPER  GR 

1978 

GFELLER  FR 

1979 

CAPETANAKIS  JI 

1979 

TOBAGI  FA 

1980 

TURIN  GL 

1980 

BAKER  DJ 

1981 

RAY CHAUDHURI  D 

1981 

PICKHOLTZ  RL 

1982 

LIN  S 

1983 

TAKAGI  H 

1984 

CIMINI  LJ 

1985 

VERDU  S 

1986 

SALEH  AAM 

1987 

EPHREMIDES  A 

1987 

BERTSEKAS  D 

1987 

JUBIN  J 

1987 

HAGENAUER  J 

1988 

GOODMAN  DJ 

1989 

PROAKIS  JG 

1989 

LUPAS  R 

1989 

KARNP 

1990 

BINGHAM  JAC 

1990 

VARANASI  MK 

1990 

LUPAS  R 

1990 

GILHOUSEN  KS 

1991 

LEE  WCY 

1991 

BELL  SYST  TECH  J 
BELL  SYST  TECH  J 
BELL  SYST  TECH  J 
P  FALL  JOINT  COMP  C 
IEEE  T  COMMUN  TECHNO 
MICROWAVE  MOBILE  COM 
IEEE  T  COMMUN 
IEEE  T  COMMUN 
QUEUEING  SYSTEMS 
IEEE  T  COMMUN 
COMMUN  ACM 
IEEE  T  INFORM  THEORY 
IEEE  T  COMMUN 
IEEE  T  VEH  TECHNOL 
P  IEEE 

IEEE  T  INFORM  THEORY 
COMPUT  NETWORKS 
P  IEEE 

IEEE  T  COMMUN 

IEEE  T  COMMUN 

IEEE  T  COMMUN 

ERROR  CONTROL  CODING 

IEEE  T  COMMUN 

IEEE  T  COMMUN 

IEEE  T  INFORM  THEORY 

IEEE  J  SEL  AREA  COMM 

P  IEEE 

DATA  NETWORKS 
P  IEEE 

IEEE  T  COMMUN 
IEEE  T  COMMUN 
DIGITAL  COMMUNICATIO 
IEEE  T  INFORM  THEORY 
P  ARRL  CRRL  AM  RAD  9  C 
IEEE  COMMUN  MAG 
IEEE  T  COMMUN 
IEEE  T  COMMUN 
IEEE  T  VEH  TECHNOL 
IEEE  T  VEH  TECHNOL 


V47 

P957 

12 

V47 

P73 

12 

V48 

P2445 

27 

V37 

P281 

23 

V19 

P628 

38 

134 

V23 

P 1 4 1 7 

84 

V23 

P1400 

75 

VI 

55 

V23 

P410 

19 

V19 

P395 

28 

V22 

P644 

20 

V25 

P795 

72 

V27 

P264 

19 

V67 

P1474 

36 

V25 

P505 

24 

V4 

P245 

34 

V68 

P328 

30 

V29 

P1694 

20 

V29 

P895 

20 

V30 

P855 

23 

52 

V32 

P246 

25 

V33 

P665 

55 

V32 

P85 

48 

V5 

P128 

45 

V75 

P56 

38 

36 

V75 

P21 

33 

V36 

P389 

25 

V37 

P885 

98 

96 

V35 

P123 

37 

P134 

52 

V28 

P5 

40 

V38 

P509 

36 

V38 

P496 

35 

V40 

P303 

212 

V40 

P291 

66 
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NANDA  S 

1991 

PICKHOLTZ  RL 

1991 

GOODMAN  DJ 

1991 

BERTSEKAS  D 

1992 

ZANDER  J 

1992 

MOULY  M 

1992 

VITERBI  AM 

1993 

HASHEMI  H 

1993 

YEEN 

1993 

ARIY AVISITAKUL  S 

1993 

RAY CHAUDHURI  D 

1994 

PERKINS  CE 

1994 

STEVENS  WR 

1994 

BHARGHAVAN  V 

1994 

ACAMPORA  AS 

1994 

BAIER  A 

1994 

WINTERS  JH 

1994 

MADHOW  U 

1994 

MOOSE  PH 

1994 

JAKES  WC 

1994 

ARIY  AVISITAKUL  S 

1994 

PROAKIS  JG 

1995 

VITERBI  AJ 

1995 

CACERES  R 

1995 

KOHNO  R 

1995 

PAHLAVAN  K 

1995 

GERLA  M 

1995 

BAKRE A 

1995 

HONIG  M 

1995 

YATES  RD 

1995 

GERLA  M 

1995 

JOHNSON  DB 

1996 

RAPPAPORT  TS 

1996 

FOSCHINI  GJ 

1996 

MURTHY  S 

1996 

ADACHI  F 

1996 

PERKINS  C 

1996 

JOHNSON  DB 

1996 

*IEEE 

1997 

IEEE  T  VEH  TECHNOL 
IEEE  T  VEH  TECHNOL 
IEEE  T  VEH  TECHNOL 
DATA  NETWORKS 
IEEE  T  VEH  TECHNOL 
GSM  SYSTEM  MOBILE  CO 
IEEE  J  SEL  AREA  COMM 
P  IEEE 

P  IEEE  PIMRC  93  YOK 
IEEE  T  COMMUN 
IEEE  J  SEL  AREA  COMM 
P  ACM  SIGCOMM  94  LON 
TCP  IP  ILLUSTRATED 
P  ACM  SIGCOMM  94 
IEEE  J  SEL  AREA  COMM 
IEEE  J  SEL  AREA  COMM 
IEEE  T  COMMUN 
IEEE  T  COMMUN 
IEEE  T  COMMUN 
MICROWAVE  MOBILE  COM 
IEEE  T  COMMUN  1 
DIGITAL  COMMUNICATIO 
CDMA  PRINCIPLES  SPRE 
IEEE  J  SEL  AREA  COMM 
IEEE  COMMUN  MAG 
WIRELESS  INFORMATION 
ACM  BALTZER  J  WIRELE 
P  15  INT  C  DISTR  COM 
IEEE  T  INFORM  THEORY 
IEEE  J  SEL  AREA  COMM 
WIREL  NETW 
MOBILE  COMPUTING 
WIRELESS  COMMUNICATI 
BELL  LABS  TECH  J 
ACM  BALTZER  MOBILE  N 
IEICE  T  COMMUN  EB 
2002  RFC 

MOBILE  COMPUTING 
80211  IEEE 


V40 

P584 

47 

V40 

P313 

40 

V40 

P170 

39 

79 

V41 

P57 

32 

32 

VI 1 

P892 

54 

V81 

P943 

39 

P109 

32 

V41 

P1626 

31 

V12 

P1401 

104 

P234 

86 

VI 

59 

P212 

57 

V12 

P1365 

56 

V12 

P733 

56 

V42 

PI  740 

34 

V42 

P3178 

34 

V42 

P2908 

33 

31 

V42 

P597 

30 

154 

114 

V13 

P850 

43 

V33 

P58 

42 

40 

VI 

P255 

39 

P136 

35 

V41 

P944 

34 

V13 

P1341 

33 

VI 

P255 

32 

P153 

133 

110 

VI 

P41 

44 

VI 

P183 

38 

V79 

PI  3 1 6 

36 

31 

30 

91 
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LIN  CHR 

1997 

BALAKRISHNAN  H 

1997 

PARK  VD 

1997 

RAY CHAUDHURI  D 

1997 

HARA  S 

1997 

TOH  CK 

1997 

CROW  BP 

1997 

KAHN JM 

1997 

CHIANG  CC 

1997 

*IEEE 

1997 

BROCH  J 

1998 

TAROKH  V 

1998 

ALAMOUTI  SM 

1998 

VERDU  S 

1998 

ADACHI  F 

1998 

FOSCHINI  GJ 

1998 

KO  YB 

1998 

DAHLMAN E 

1998 

LI  Y 

1998 

SINGH  S 

1998 

SEO  S 

1998 

PERKINS  CE 

1999 

ROYER  EM 

1999 

VANNEE  R 

1999 

RODOPLU  V 

1999 

ROYER  EM 

1999 

SOBRINHO  JL 

1999 

TAROKH  V 

1999 

NI  SY 

1999 

*IEEE 

1999 

IWATA  A 

1999 

LIN  CHR 

1999 

LI  Y 

1999 

ZHOU  LD 

1999 

GUPTA  P 

2000 

BIANCHI  G 

2000 

CALIF 

2000 

VANNEE  R 

2000 

HAARTSEN  JC 

2000 

IEEE  J  SEL  AREA  COMM 
IEEE  ACM  T  NETWORK 
P  IEEE  INFOCOM  97  KO 
IEEE  J  SEL  AREA  COMM 
IEEE  COMMUN  MAG 
WIRELESS  PERS  COMMUN 
IEEE  COMMUN  MAG 
P  IEEE 

P  IEEE  SICON  97  APR 
802111997  IEEE 
P  4  ANN  ACM  IEEE  INT 
IEEE  T  INFORM  THEORY 
IEEE  J  SEL  AREA  COMM 
MULTIUSER  DETECTION 
IEEE  COMMUN  MAG 
WIRELESS  PERS  COMMUN 
P  ACM  IEEE  MOBICOM  9 
IEEE  COMMUN  MAG 
IEEE  T  COMMUN 
P  4  ANN  ACM  IEEE  INT 
IEICE  T  COMMUN  EB 
P  2  IEEE  WORKSH  MOB 
IEEE  PERS  COMMUN 
IEEE  COMMUN  MAG 
IEEE  J  SEL  AREA  COMM 
IEEE  PERSONAL  CO  APR 
IEEE  J  SEL  AREA  COMM 
IEEE  T  INFORM  THEORY 
P  5  ANN  ACM  IEEE  INT 
80211 A  IEEE 

IEEE  J  SEL  AREA  COMM 
IEEE  J  SEL  AREA  COMM 
IEEE  J  SEL  AREA  COMM 
IEEE  NETWORK 
IEEE  T  INFORM  THEORY 
IEEE  J  SEL  AREA  COMM 
IEEE  ACM  T  NETWORK 
OFDM  WIRELESS  MULTIM 
IEEE  PERS  COMMUN 


V15 

P1265 

62 

V5 

P756 

59 

P1405 

51 

V15 

P83 

47 

V35 

P126 

43 

V4 

P103 

39 

V35 

PI  16 

36 

V85 

P265 

30 

P197 

30 

30 

P85 

108 

V44 

P744 

75 

V16 

P1451 

64 

62 

V36 

P56 

58 

V6 

P311 

56 

P66 

48 

V36 

P70 

41 

V46 

P902 

39 

P181 

33 

V81 

P1508 

33 

P90 

121 

V6 

P46 

68 

V37 

P82 

42 

V17 

P1333 

40 

P46 

39 

V17 

P1353 

38 

V45 

P1456 

38 

P151 

36 

36 

V17 

P1369 

35 

V17 

P1426 

35 

V17 

P461 

35 

V13 

P24 

33 

V46 

P388 

81 

V18 

P535 

77 

V8 

P785 

49 

45 

V7 

P28 

37 
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PERKINS  CE 

2001 

XU  SG 

2001 

AKYILDIZ  IF 

2002 

JOHNSON  DB 

2003 

XUEF 

2004 

JOHNSON  D 

2004 

AD  HOC  NETWORKING 
IEEE  COMMUN  MAG 
COMPUT  NETW 
DYNAMIC  SOURCE  ROUTI 
WIREL  NETW 
3775  RFC 


36 

V39 

P130 

30 

V38 

P393 

23 

13 

V10 

P169 

6 

6 


There  were  a  total  of  133  documents  selected  as  seminal.  While  the  earliest 
seminal  documents  in  the  physical  sciences  literatures  we  have  examined 
previously  were  published  in  the  18th  and  19th  centuries,  the  earliest  document 
identified  in  the  present  Wireless  LANs  analysis  is  Shannon’s  classic  1948 
paper(s)  on  the  mathematical  theory  of  communication. 

Although  the  journals  listed  in  Table  A 13-2  cover  many  aspects  of  communication, 
three  stand  out  in  terms  of  numbers  of  seminal  papers  published.  From  1948-1969 
(with  an  outlier  in  1996),  the  Bell  System  Technical  Journal  published  eight  of 
these  seminal  papers.  From  1975-1994  (with  an  outlier  in  1998),  the  IEEE 
Transactions  on  Communications  published  19  of  these  seminal  papers.  From 
1993-2000  (with  an  outlier  in  1987),  the  IEEE  Journal  on  Selected  Areas  in 
Communications  published  1 5  of  these  seminal  papers. 

Each  of  these  three  journals  commanded  essentially  a  separate  time  frame,  with 
each  time  frame  appearing  approximately  sequentially. 
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APPENDIX  14  -  IMPORTANT  KEYWORDS 


For  this  appendix,  6230  records  were  retrieved  covering  1991  to  mid-2005,  and  the 
highest  frequency  Keywords  were  extracted.  The  very  highest  frequency 
Keywords  are  quite  generic,  whereas  the  lowest  frequency  Keywords  on  this  list 
become  quite  specific.  These  data  allow  the  reader  to  generate  a  manual  taxonomy 
based  on  the  reader’s  perception  of  the  structure  of  the  Wireless  LANs  literature, 
with  the  high  frequency  phrases  defining  the  top  level  categories  and  the  low 
frequency  phrases  populating  the  pre-determined  categories.  These  Keyword  data 
also  allow  the  reader  to  construct  his/  her  query  for  retrieving  selected  literatures  of 
personal  interest. 


KEYWORD 

FREQ 

PERFORMANCE 

320 

SYSTEMS 

255 

WIRELESS  LAN 

252 

AD  HOC  NETWORKS 

227 

NETWORKS 

221 

CDMA 

220 

WIRELESS  NETWORKS 

204 

CAPACITY 

171 

CHANNELS 

161 

BLUETOOTH 

150 

SYSTEM 

132 

OFDM 

120 

WIRELESS 

115 

DIVERSITY 

105 

WLAN 

104 

FADING  CHANNELS 

101 

PERFORMANCE  ANALYSIS 

95 

DS-CDMA 

94 

WIRELESS  ATM 

93 

COMMUNICATION 

93 

IEEE  802.11 

91 

PROTOCOL 

91 

DESIGN 

89 

WIRELESS  COMMUNICATIONS 

89 

PERFORMANCE  EVALUATION 

88 

POWER  CONTROL 

88 

AD  HOC  NETWORK 

85 

ROUTING 

82 

PROTOCOLS 

78 

QUALITY  OF  SERVICE 

77 

TCP 

76 

ACCESS 

75 

CHANNEL  ESTIMATION 

74 
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RADIO  71 

QOS  69 

MOBILE  AD  HOC  NETWORKS  65 

THROUGHPUT  64 

MOBILE  RADIO  63 

MULTIPLE-ACCESS  61 

ARCHITECTURE  61 

CHANNEL  60 

ATM  60 

MOBILE  58 

MOBILE  COMPUTING  58 

SIMULATION  57 

EQUALIZATION  56 

WIRELESS  COMMUNICATION  55 

MOBILITY  54 

RECEIVER  54 

INTERNET  53 

MULTIUSER  DETECTION  53 

MOBILE  COMMUNICATIONS  52 

CODE  DIVISION  MULTIPLE  ACCESS  49 

ALGORITHM  48 

CODES  47 

WIRELESS  NETWORK  47 

HANDOFF  46 

PACKET  RADIO  NETWORKS  46 

WIRELESS  LANS  46 

TRANSMISSION  45 

MULTIMEDIA  45 

SPREAD-SPECTRUM  44 

CMOS  44 

MANET  43 

MODULATION  42 

MOBILE  IP  42 

SCHEDULING  40 

MULTICAST  40 

ALGORITHMS  40 

WIRELESS  AD  HOC  NETWORKS  39 

SPREAD  SPECTRUM  39 

POWER-CONTROL  39 

SENSE  MULTIPLE-ACCESS  39 

ANTENNAS  38 

MEDIUM  ACCESS  CONTROL  38 

CELLULAR  CDMA  38 

MODEL  38 

INTERFERENCE  SUPPRESSION  37 

MAC  37 

WIRELESS  LOCAL  AREA  NETWORK  (WLAN)  36 

MAC  PROTOCOL  36 
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TDMA 

36 

SECURITY 

35 

PROPAGATION 

35 

UMTS 

34 

MOBILITY  MANAGEMENT 

33 

SENSOR  NETWORKS 

33 

RADIO  CHANNELS 

32 

INTERFERENCE 

32 

AUTHENTICATION 

31 

TIME 

31 

SIGNALS 

31 

SYNCHRONIZATION 

31 

CODE  DIVISION  MULTIPLE  ACCESS  (CDMA) 

31 

CONGESTION  CONTROL 

30 

CELLULAR  RADIO  SYSTEMS 

30 

SERVICES 

30 

MULTIPATH  CHANNELS 

30 

AD  HOC 

29 

MULTIPLE  ACCESS 

28 

MEDIUM  ACCESS  CONTROL  (MAC) 

28 

MANAGEMENT 

28 

WLAN  ANTENNAS 

28 

NETWORK 

28 

SCHEME 

28 

AD-HOC  NETWORKS 

28 

FAIRNESS 

27 

MOBILE  AD  HOC  NETWORK 

27 

SMART  ANTENNAS 

27 

ETHERNET 

27 

DELAY 

27 

CALL  ADMISSION  CONTROL 

26 

TRANSMIT  DIVERSITY 

26 

MOBILE  COMMUNICATION 

26 

ADMISSION  CONTROL 

26 

ORTHOGONAL  FREQUENCY-DIVISION 
MULTIPLEXING  (OFDM) 

26 

MIMO 

26 

CODE-DIVISION  MULTIPLE  ACCESS  (CDMA) 

26 

RADIO  NETWORKS 

25 

MODELS 

25 

ROUTING  PROTOCOL 

25 

FADING 

24 

PHASE  NOISE 

24 

W-CDMA 

24 

ENERGY  EFFICIENCY 

24 

VOICE 

24 

ERROR  CONTROL 

24 

COMMUNICATION-SYSTEMS 

24 
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CELLULAR  NETWORKS 

24 

ORTHOGONAL  FREQUENCY  DIVISION 
MULTIPLEXING  (OFDM) 

24 

HIPERLAN/2 

23 

PACKET 

23 

TECHNOLOGY 

23 

BROADCAST 

23 

TRANSCEIVER 

23 

MC-CDMA 

22 

BAND 

22 

MOBILE  RADIO  SYSTEMS 

22 

FRONT-END 

22 

MULTIPATH 

22 

IP 

22 

CSMA/CD 

22 

LOCAL  AREA  NETWORKS 

21 

ANTENNA  ARRAYS 

21 

HIPERLAN 

21 

NOISE 

21 

ERROR 

21 

SLOTTED  ALOHA 

20 

SCALABILITY 

20 

RAKE  RECEIVER 

20 

ATM  NETWORKS 

20 

ANTENNA  DIVERSITY 

20 

ALLOCATION 

20 

MOBILE  NETWORKS 

20 

INTERFERENCE  CANCELLATION 

20 

CLUSTERING 

19 

DS/CDMA 

19 

WIRELESS  LOCAL  AREA  NETWORK 

19 

ARQ 

19 

BEHAVIOR 

19 

LAN 

19 

LINKS 

19 

PCS 

19 

QUALITY  OF  SERVICE  (QOS) 

19 

MOBILE  AD  HOC  NETWORK  (MANET) 

19 

HANDOVER 

18 

CODE-DIVISION  MULTIPLE  ACCESS 

18 

GHZ 

18 

RESOURCE  ALLOCATION 

18 

CAPTURE 

18 

MULTIPLE-ACCESS  CHANNELS 

18 

OPTIMIZATION 

18 

STABILITY 

18 

ANTENNA 

17 

MONOPOLE  ANTENNAS 

17 
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PERSONAL  COMMUNICATION 

17 

PACKET  RADIO 

17 

LINK 

17 

QUALITY-OF-SERVICE 

17 

VIDEO 

17 

BEAMFORMING 

17 

CSMA/CA 

17 

QUALITY-OF-SERVICE  (QOS) 

17 

MULTIHOP 

17 

SCHEMES 

17 

MICROSTRIP  ANTENNAS 

16 

SERVICE 

16 

ROUTING  PROTOCOLS 

16 

FADING  CHANNEL 

16 

PRMA 

16 

AD  HOC  WIRELESS  NETWORKS 

16 

ADAPTIVE  MODULATION 

16 

WPAN 

16 

MOBILE  ANTENNAS 

16 

REDUCTION 

16 

TRANSMITTER  DIVERSITY 

15 

MULTIPATH  FADING 

15 

SCATTERNET 

15 

TCP/IP 

15 

WIRELESS  LOCAL  AREA  NETWORKS 

15 

ASSIGNMENT 

15 

POWER  AMPLIFIER 

15 

QUALITY 

15 

COCHANNEL  INTERFERENCE 

15 

FREQUENCY 

15 

CSMA 

15 

TOPOLOGY  CONTROL 

15 

RF 

14 

CDMA  SYSTEM 

14 

SPECTRUM 

14 

LOW  POWER 

14 

WIRELESS  SENSOR  NETWORKS 

14 

ADAPTIVE  ARRAYS 

14 

RANDOM  ACCESS 

14 

MOBILE  AD  HOC  NETWORKS  (MANETS) 

14 

BROADCASTING 

14 

CELLULAR 

14 

ARRAY  SIGNAL  PROCESSING 

14 

802.11 

14 

PICONET 

14 

SIGNAL 

14 

RESERVATION  MULTIPLE  ACCESS 

14 

PERSONAL  COMMUNICATIONS 

14 
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MULTICARRIER  MODULATION 

14 

OPTICAL  WIRELESS 

14 

TRANSMIT  POWER  CONTROL 

13 

IEEE  802.1 1A 

13 

LOCATION  MANAGEMENT 

13 

ALOHA 

13 

DISTRIBUTED  ALGORITHMS 

13 

MULTIPATH  PROPAGATION 

13 

RECEIVERS 

13 

IEEE  802.11B 

13 

DIVERSITY  METHODS 

13 

RESERVATION  MULTIPLE-ACCESS 

13 

GPRS 

13 

ORTHOGONAL  FREQUENCY  DIVISION 
MULTIPLEXING 

13 

MAC  PROTOCOLS 

13 

DUAL-BAND  ANTENNAS 

13 

IMT-2000 

13 

PROBABILITY 

13 

ON-DEMAND  ROUTING 

13 

MULTICARRIER  CDMA 

13 

MIXER 

13 

GSM 

13 

CODE  DIVISION  MULTIACCESS 

12 

RAYLEIGH  FADING 

12 

SEQUENCES 

12 

AD-HOC  NETWORK 

12 

IDENTIFICATION 

12 

TRACKING 

12 

OFDM  SYSTEMS 

12 

INDOOR  RADIO  COMMUNICATION 

12 

DATA-TRANSMISSION 

12 

SPACE-TIME  CODING 

12 

ENERGY  CONSUMPTION 

12 

ACQUISITION 

12 

WIRELESS  LAN  (WLAN) 

12 

LOW-NOISE  AMPLIFIER  (LNA) 

12 

ORTHOGONAL  FREQUENCY-DIVISION 
MULTIPLEXING 

12 

POLLING 

12 

BANDWIDTH  ALLOCATION 

12 

CONNECTIVITY 

12 

ARRAY 

12 

TRANSPORT 

12 

REVERSE  LINK 

12 

INTERSYMBOL  INTERFERENCE 

12 

SPREAD  SPECTRUM  COMMUNICATION 

12 

FLOODING 

12 
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RSVP  12 

HYDROGEN  12 

POWER  MANAGEMENT  12 

WIRELESS  INTERNET  12 

ANTENNA-ARRAYS  11 

LANS  1 1 

RESOURCE  MANAGEMENT  11 

PRIORITY  11 

WCDMA  1 1 

RAKE  COMBINING  11 

ARRAYS  1 1 

CODE-DIVISION  MULTIPLE-ACCESS  (CDMA)  1 1 

QUEUEING  ANALYSIS  11 

DELAY  SPREAD  1 1 

CDMA  MOBILE  RADIO  11 

OUTAGE  PROBABILITY  11 

LOW-NOISE  AMPLIFIER  11 

DISTRIBUTIONS  11 

ALLOYS  1 1 

QAM  11 

FREQUENCY  OFFSET  11 

SENSOR  NETWORK  11 

PEAK  1 1 

POLARIZATION  1 1 

MULTIPLE-ACCESS  COMMUNICATIONS  10 

INFORMATION  10 

WIRELESS  ACCESS  10 

FEC  10 

ANTENNA  ARRAY  10 

UBIQUITOUS  COMPUTING  10 

MULTIPLE  ACCESS  PROTOCOLS  10 

RADIO  PROPAGATION  10 

WAP  10 

IEEE-802.11  PROTOCOL  10 

FLOW  CONTROL  10 

FILTER  10 

VIDEO  TRANSMISSION  10 

MODELING  10 

MULTIPATH  FADING  CHANNELS  10 

INTEGRATION  10 

UPLINK  10 

ADAPTIVE  ANTENNAS  10 

TCP  PERFORMANCE  10 

WIRELESS  ATM  NETWORKS  1 0 

3G  10 

CELLULAR  NETWORK  10 

SOLUBILITY  10 

MHZ  10 
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BUILDINGS  10 

PERMEATION  10 

SOFT  HAN  DOFF  10 

MOBILE  AD-HOC  NETWORKS  1 0 

STEEL  10 

CDMA  SYSTEMS  10 

PREDICTION  10 

NETWORK  CAPACITY  9 

INVERTED-F  ANTENNA  9 

MULTICASTING  9 

SLOT  ANTENNA  9 

POWER  SAVING  9 

ACCESS  PROTOCOLS  9 

WIRELESS  VIDEO  9 

CODE  ACQUISITION  9 

ASYNCHRONOUS  TRANSFER  MODE  9 

SYNTHESIZER  9 

FREQUENCY  SYNTHESIZER  9 

INDUCTORS  9 

LOW  VOLTAGE  9 

OPTICAL  COMMUNICATION  9 

POWER  CONSUMPTION  9 

RADIO  SYSTEMS  9 

TRANSPORT  PROTOCOLS  9 

FREQUENCY  SYNTHESIZERS  9 

DIVISION  MULTIPLE-ACCESS  9 

CHANNEL  ASSIGNMENT  9 

SERVICE  DISCOVERY  9 

INTERFERENCE  STATISTICS  9 

INTEGRATED  SERVICES  9 

RANDOM  SIGNATURE  SEQUENCES  9 

ENERGY  SAVING  9 

STATISTICAL-MODEL  9 

IEEE802.1 1  9 

JOINT  DETECTION  9 

LINK  ADAPTATION  9 

IEEE-802.11  9 

MOBILE  WIRELESS  NETWORKS  9 

SCATTERNET  FORMATION  9 

MULTIMEDIA  COMMUNICATION  9 

IMPLEMENTATION  9 

RESERVATION  9 

MULTICARRIER  9 

MOBILE  RADIOCOMMUNICATION  9 

LAND  MOBILE  RADIO  CELLULAR  SYSTEMS  9 

PACKET  SCHEDULING  9 

MULTIPLE  ACCESS  CONTROL  9 

AODV  9 
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MULTICAST  ROUTING  9 

FILTERS  9 

CONVERGENCE  9 

BROADBAND  9 

MICROSTRIP  ANTENNA  9 

LOCATION  8 

MODULATOR  8 

MOBILE  INTERNET  8 

KEY  EXCHANGE  8 

SOFTWARE  RADIO  8 

SYSTEM  CAPACITY  8 

MMIC  8 

PERSONAL  COMMUNICATION  NETWORKS  8 

60  GHZ  8 

MULTIACCESS  COMMUNICATION  8 

MEDIUM  ACCESS  8 

COMPRESSION  8 

ACCESS  CONTROL  8 

DECOMPOSITION  8 

FORWARD  ERROR  CORRECTION  8 

RF  TRANSCEIVER  8 

SPREAD-SPECTRUM  COMMUNICATION  8 

DOMINATING  SETS  8 

SPREAD-SPECTRUM  COMMUNICATIONS  8 

MATCHED  FILTER  8 

PRINTED  ANTENNAS  8 

FUZZY  LOGIC  8 

MULTIHOP  RADIO  NETWORKS  8 

AMPLIFIER  8 

DIRECT  CONVERSION  8 

WIRELESS  APPLICATION  PROTOCOL  8 

DIRECT  SEQUENCE  8 

MANET  STEEL  8 

TRANSMISSION  POWER  CONTROL  8 

LOAD  BALANCING  8 

MOBILE  NETWORKING  8 

H.263  8 

POWER  8 

GRAPHS  8 

BANDWIDTH  8 

INTERWORKING  7 

AVERAGE  POWER  RATIO  7 

MULTIPATH  ROUTING  7 

NETWORK  PROTOCOLS  7 

COLLISION  AVOIDANCE  7 

MIXERS  7 

PATCH  ANTENNA  7 

PRIVACY  7 
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PARTIAL  TRANSMIT  SEQUENCES 

7 

RADIO  RECEIVERS 

7 

CELLULAR  SYSTEM 

7 

MOBILITY  MODEL 

7 

MULTIUSER  DETECTORS 

7 

HYBRID  ARQ 

7 

(OFDM) 

7 

INDOOR 

7 

WIRELESS  MULTIMEDIA  COMMUNICATIONS 

7 

ERROR  RESILIENCE 

7 

TURBO  CODES 

7 

802.11  A 

7 

BLIND  EQUALIZATION 

7 

BICMOS 

7 

HETEROGENEOUS  NETWORKS 

7 

OSCILLATOR 

7 

PARAMETERS 

7 

RELIABLE  MULTICAST 

7 

CIRCUITS 

7 

WIRELESS  APPLICATION  PROTOCOL  (WAP) 

7 

SPACE  DIVERSITY 

7 

ENVIRONMENT 

7 

PHYSICAL  LAYER 

7 

DUAL-BAND 

7 

HOME  NETWORK 

7 

ARQ  PROTOCOLS 

7 

SPREADING  SEQUENCES 

7 

MULTIMEDIA  COMMUNICATIONS 

7 

PERVASIVE  COMPUTING 

7 

TIME  DIVISION  DUPLEX 

7 

UDP 

7 

MEDIA  ACCESS  CONTROL 

7 

IRDA 

7 

WIRELESS  TCP 

7 

DIVERSITY  RECEPTION 

7 

PACKET  RADIO  NETWORK 

7 

CSMA  CD 

7 

CHANNEL  CAPACITY 

7 

MOBILITY  MODELS 

7 

DEMODULATION 

7 

CAPTURE  EFFECT 

7 

IEEE  802.1  IE 

7 

SLOTTED-ALOHA 

7 

DETECTORS 

7 

SENSE  MULTIPLE  ACCESS 

7 

MOBILE  NETWORK 

7 

VCO 

7 

RF  CMOS 

7 
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INTERFACE  7 

DIFFERENTIAL  DETECTION  7 

VERIFICATION  7 

ADAPTIVE  ARRAY  7 

IPV6  7 

MOBILE  RADIO  COMMUNICATION  7 

MOBILE  COMMERCE  7 

WIDE-BAND  CDMA  7 

CMOS  INTEGRATED  CIRCUITS  7 

WIDEBAND  CDMA  7 

MULTI-HOP  7 

CODE-DIVISION  MULTIPLE-ACCESS  7 

MICROCELL  7 

MIMO  SYSTEMS  7 

CHANNEL  ALLOCATION  7 

HETEROGENEOUS  NETWORK  6 

STBC  6 

COMPUTER  NETWORKS  6 

RECEPTION  6 

ITERATIVE  DECODING  6 

APPROXIMATION  ALGORITHMS  6 

EQUALIZERS  6 

MULTI-PATH  6 

RATE  CONTROL  6 

FREQUENCY-SYNTHESIZER  6 

BLIND  CHANNEL  ESTIMATION  6 

4G  6 

STATISTICS  6 

SYSTEM  DESIGN  6 

SPECTRAL  EFFICIENCY  6 

MEDIUM  ACCESS  CONTROL  PROTOCOL  6 

REAL-TIME  6 

WIRELESS  SYSTEMS  6 

HIERARCHICAL  ROUTING  6 

WIRELESS  SECURITY  6 

REJECTION  6 

REROUTING  6 

GAME  THEORY  6 

RETRANSMISSION  6 

PATCH  ANTENNAS  6 

PB-17LI  6 

UWB  6 

COLLISION  RESOLUTION  6 

RAYLEIGH  FADING  CHANNELS  6 

MULTIMEDIA  TRAFFIC  6 

CONVOLUTIONAL-CODES  6 

GATEWAY  6 

RADIO  CHANNEL  6 
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TRANSMITTER  POWER  CONTROL 

6 

VOLTAGE-CONTROLLED  OSCILLATOR  (VCO) 

6 

GMSK 

6 

QOS  ROUTING 

6 

METAL-PLATE  ANTENNAS 

6 

DECT 

6 

CODE-DIVISION  MULTIACCESS 

6 

TRANSMITTER 

6 

SEQUENCE  SPREAD-SPECTRUM 

6 

RADIO  COMMUNICATION 

6 

CELLULAR  SYSTEMS 

6 

AD  HOC  NETWORKING 

6 

MOBILITY  MODELING 

6 

MPEG-4 

6 

BARRIERS 

6 

SIMULATED  ANNEALING 

6 

BLUETOOTH  SCATTERNET 

6 

MARTENSITIC  STEEL 

6 

GPS 

6 

TRANSMISSION  CONTROL  PROTOCOL 

6 

MONOPOLE  ANTENNA 

6 

NETWORK  MANAGEMENT 

6 

PETRI  NETS 

6 

SMART  ANTENNA 

6 

WIRELESS  LINKS 

6 

MONOLITHIC  MICROWAVE  INTEGRATED 
CIRCUIT  (MMIC) 

6 

WIRELESS  AD  HOC  NETWORK 

6 

MULTIBAND 

6 

PACKET  SWITCHING 

6 

INTERNET  ACCESS 

6 

FRAMEWORK 

6 

OSCILLATORS 

6 

TELEMEDICINE 

6 

MULTIPLE-ACCESS  COMMUNICATION 

6 

DISTRIBUTED  ALGORITHM 

6 

MOBILE  MULTIMEDIA 

6 

INTERFERENCE  CANCELLER 

6 

SUPPRESSION 

6 

SPACE-TIME  CODES 

6 

TRANSCEIVERS 

6 

DCF 

6 

GLOBAL  POSITIONING  SYSTEM  (GPS) 

6 

MARKOV  CHAIN 

6 

BROADBAND  PACKET  WIRELESS  ACCESS 

6 

PACKET  NETWORKS 

6 

WIRELESS  LOCAL  AREA  NETWORK  (LAN) 

6 

GUARANTEED  DELIVERY 

6 
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VOIP 

6 

IRRADIATION 

6 

PI  FA 

6 

MULTIPLE-INPUT  MULTIPLE-OUTPUT  (MIMO) 

6 

NONLINEAR  DISTORTION 

6 

BER 

6 

EVOLUTION 

6 

MODEL  CHECKING 

5 

BANDPASS  FILTER 

5 

COATINGS 

5 

TRANSMISSION  CONTROL  PROTOCOL  (TCP) 

5 

TRAFFIC  CONTROL 

5 

BACKOFF 

5 

MILLIMETER  WAVES 

5 

CLUSTER 

5 

SIGNAL  PROCESSING 

5 

MOBILE  ROBOTS 

5 

TRAFFIC  MANAGEMENT 

5 

BROADBAND  WIRELESS  ACCESS 

5 

MAI 

5 

CMOS  TRANSCEIVER 

5 

CAPACITY  ANALYSIS 

5 

CANCELLATION 

5 

RESOURCE  RESERVATION 

5 

CONTEXT  AWARENESS 

5 

LOCAL-AREA  NETWORKS 

5 

CMOS  ANALOG  INTEGRATED  CIRCUITS 

5 

TRAFFIC  ENGINEERING 

5 

CELL  SEARCH 

5 

HIPERLAN2 

5 

MAXIMUM-LIKELIHOOD 

5 

CELLULAR  RADIO 

5 

COLLISION  DETECTION 

5 

UNEQUAL  ERROR  PROTECTION 

5 

SECURE  ROUTING 

5 

COFDM 

5 

SPACE  DIVISION  MULTIPLE  ACCESS 

5 

CONCEALMENT 

5 

COMMUNICATION  SYSTEMS 

5 

MULTI-CARRIER  CDMA 

5 

SUPPORT 

5 

COMMUNICATION-NETWORKS 

5 

MMSE 

5 

GENERATION 

5 

MICRO-MOBILITY 

5 

BLOCKING  PROBABILITY 

5 

SEQUENCE 

5 

TECHNOLOGIES 

5 

Page  150 


LOW-POWER 

5 

CHANNEL  MODEL 

5 

BASE  STATION 

5 

BLAST 

5 

IMPERFECT  POWER-CONTROL 

5 

STAINLESS-STEEL 

5 

CO-CHANNEL  INTERFERENCE 

5 

BLUETOOTH  TECHNOLOGY 

5 

SOFTWARE  DEFINED  RADIO 

5 

TENSILE 

5 

COEXISTENCE 

5 

CATV 

5 

MULTICODE 

5 

CARE 

5 

WIRELESS  PERSONAL  AREA  NETWORK 
(WPAN) 

5 

PARAMETER  ESTIMATION 

5 

INFRARED  COMMUNICATIONS 

5 

ADAPTIVE  ANTENNA  ARRAY 

5 

PARALLEL  INTERFERENCE  CANCELLATION 

5 

DIPOLE  ANTENNAS 

5 

OPERATION 

5 

WI-FI 

5 

WIRELESS  PERSONAL  AREA  NETWORKS 

5 

DIRECT  SEQUENCE  SPREAD  SPECTRUM 

5 

PAPR 

5 

MULTIRATE  CDMA 

5 

MULTIPLE  ACCESS  INTERFERENCE 

5 

OFFSET 

5 

LINK  CAPACITY 

5 

IS-95 

5 

FREQUENCY  HOPPING 

5 

REAL-TIME  COMMUNICATION 

5 

LINK  LAYER 

5 

WIRELESS  NETWORKING 

5 

ESPRIT 

5 

FAULT  TOLERANCE 

5 

WIRELESS  ASYNCHRONOUS  TRANSFER 

MODE  (ATM) 

5 

PACKET  TRANSMISSION 

5 

INTERCARRIER  INTERFERENCE 

5 

DISTORTION 

5 

INFORMATION  DISSEMINATION 

5 

DISTRIBUTED  COMPUTING 

5 

QUEUEING  THEORY 

5 

NETWORK  ARCHITECTURE 

5 

WATM 

5 

ENVIRONMENTS 

5 

VOLTAGE-CONTROLLED  OSCILLATORS 

5 
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(VCOS) 


WEB 

5 

LEADER  ELECTION 

5 

DECISION-FEEDBACK  EQUALIZATION 

5 

RADIO  RESOURCE  MANAGEMENT 

5 

INTEROPERABILITY 

5 

DFE 

5 

VOICE-DATA  INTEGRATION 

5 

INTERNETWORKING 

5 

ADAPTIVE  EQUALIZERS 

5 

WLANS 

5 

DIFFUSION 

5 

FREQUENCY  DIVERSITY 

5 

RAKE  RECEIVERS 

5 

FRAME  SYNCHRONIZATION 

5 

PEAK-TO-AVERAGE  POWER  RATIO 

5 

DYNAMIC  CHANNEL  ASSIGNMENT 

5 

VLSI 

5 

ADAPTIVE  BEAMFORMING 

5 

WIRELESS  PROTOCOLS 

5 

DYNAMIC  BANDWIDTH  ALLOCATION 

5 

WLAN  OPERATION 

5 

DATA  LINK  LAYER 

5 

DIGITAL  MODULATION 

5 

LINEARITY 

5 

REACTIVE  ROUTING 

5 

ADAPTIVE  EQUALIZATION 

5 

WIRELESS  LOCAL  AREA  NETWORKS  (WLAN) 

5 

CROSS-LAYER 

5 

KALMAN  FILTER 

5 

ENERGY 

5 

ACCESS-POINT  ANTENNAS 

5 

OFDMA 

5 

MULTIMEDIA  STREAMING 

5 

DIVERSITY  ANTENNAS 

5 

WIRELESS  LAN  STANDARDS 

5 

RELIABILITY 

5 

DISTRIBUTED  SYSTEM 

5 

CORRELATED  FADING 

5 

WIRELESS  LOCAL  AREA  NETWORKS  (WLANS) 

5 

EXHAUSTIVE  SERVICE  SCHEDULING 

5 

CORRELATION 

5 

MULTIPLE  ACCESS  PROTOCOL 

5 

ACCESS  POINT 

5 

AAA 

5 

PLANAR  MONOPOLE  ANTENNAS 

5 

FAIR  SCHEDULING 

5 

OUTAGE 

5 
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PHASE-LOCKED  LOOP  (PLL) 

5 

RECOVERY 

5 

PHASE-LOCKED  LOOPS 

5 

CRYPTANALYSIS 
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